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A METHOD FOR PLANT PROTECTION AGAINST INSECTS OR NEMATODES 

This invention relates to a method of protecting a plant or 
a part of said plant against insect or nematode infestation 
by one or more insects or nematodes having digestive 
cysteine and/or aspartic proteases, comprising presenting 
to a locus wherein said insect (s) or nematode (s) is to be 
controlled an inhibitory amount of a cysteine and/or 
aspartic protease inhibitor. 

Many vegetables, horticultural and field crops are attacked 
by insect pests. Most plants show some resistance to 
certain insects or nematodes; the resistance can be physi- 
cal or chemical. For example, the hairs on the leaves of 
many plants can stop small insects from getting near enough 
to the surface to chew it . In other cases plants use a 
range of complex molecules to make their tissues unattrac- 
tive or toxic. Control of such phytophagous insects and 
nematodes has traditionally been partially addressed by 
cultural and breeding methods. An effective way to reduce 
these losses is to use crop cultivars having genes for pest 
resistance (see Painter (1951) , Insect Resistance in Crop 
Plants, Macmillan: New York) . Plant breeders have attempted 
to reduce losses caused by insect and nematode attack by 
incorporating resistance genes into their varieties via 
conventional breeding programs. 

Classical approaches to host plant resistance, though 
remarkably successful in some instances, are rather empiri- 
cal . Once "traits" for resistance are discovered they are 
moved into agronomical ly acceptable lines by selection 
procedures. One limitation of the classical approach is 
that the movement of genes for resistance from one plant to 
another is restricted to soecies that can be interbred. 
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Additionally, these types of resistance may be under the 
control of many genes, and so are difficult for the plant 
breeder to exploit. Often resistant varieties have shown a 
yield depression and so have not been economically viable. 
Moreover, . if no resistance can be identified within a 
species or within related species then no improvement in 
insect pest resistance is possible by classical breeding. 

Chemical pesticides have been heavily relied upon to 
control insects and nematodes. These agents typically are 
applied on or banded into the soil, or to plant foliage or 
in bait stations. In spite of the availability of a wide 
range of chemical pesticides, phytophagous insects and 
plant parasitic nematodes remain a serious problem. Many 
chemical pesticides have the disadvantage of requiring 
repeated applications. A major problem in the use of many 
pesticides is the ability of insects to become resistant to 
the applied agents. This phenomenon occurs through selecti- 
on of the most resistant members of the insect population 
during repeated application of the agent. In addition, 
these chemicals are environmentally damaging and polluting 
the water table. A need, therefore, exists for new insect 
control agents, particularly agents that have a mode of 
action different from conventional insecticides and nemati- 
cides . 

As alternatives to synthetic compounds, certain naturally- 
occurring agents have been isolated and developed as 
pesticides. These include plant and microbial secondary 
metabolites and proteins, and natural predators or patho- 
gens of insects or nematodes (including other insects, 
fungi, bacteria, and viruses). Furthermore as recombinant 
DNA technology has advanced, genes from a donor organism 
may be transferred to a recipient organism resulting in a 
new phenotype in the recipient. In the case of transgenic 
plants, this phenotype may be resistance to insect damage 
or nematode infection if the introduced gene encodes a 



WO 98/58068 



PCT/NL98/00352 



- 3 - 

polypeptide, the action of which results in a deleterious 
effect on the pest. Consequently, there is a great interest 
and utility in finding polypeptides that have such an 
effect. Genes for these polypeptides can be used to modify 
organisms,, especially plants and microbes, so that they 
adversely affect the grov/th and development of insect 
pests. A number of such polypeptides have been described 
from Bacillus thuringiensxs , various proteinaceous protease 
and amylase inhibitors, and various plant lectins. 

One physiological system of insects and nematodes known to 
be susceptible to disruption by specific inhibitors is the 
action of digestive proteases. The digestive proteases 
hydrolyze ingested proteins and polypeptides by cleaving 
peptide bonds.- The term "protease" is specifically intended 
to include endopeptidases and exopeptidases of the four 
major catalytic classes: serine proteases, cysteine protea- 
ses, aspartic proteases and metallo proteases (see Laskow- 
ski et al. (1983), Ann. Rev, Biochem . , 49: 593-626). The 
class to which a specific protease belongs can be determi- 
ned by the pH range over which it is active, by its ability 
to hydrolyze specific proteins, by its similarity to other 
well-characterized proteases and by its sensitivity to 
various inhibitors . 

Diverse types of insect and nematode digestive enzymes 
release peptides and amino acids from dietary protein. One 
class of digestive enzymes is the class of cysteine protea- 
ses. The term "cysteine protease" is intended to describe a 
protease that possesses a highly reactive thiol group of a 
cysteine residue at the catalytic site of the enzyme. There 
is evidence that many phytophagous insects and plant 
parasitic nematodes rely, at least in part, on midgut 
cysteine proteases for protein digestion. These include but 
are not limited to Hemiptera, especially squash bugs {Anasa. 
tristis) ; green stink bug {Acrosternum hilare) ; Riptortus 
clavatus ; and almost all Coleoptera examined so far, 
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especially, Colorado potato beetle {Leptinotarsa deaemline- 
ata) ; three-lined potato beetle (Lema trilineata) ; aspara- 
gus beetle {Crioceris asparagi) ; Mexican bean beetle 
{Epilachna varivestis) ; red flour beetle (Tribolium casta- 
5 neum) ; confused flour beetle (Tribolium confusum) ; the flea 
beetles (Chaetocnema spp., Haltica spp. and Epitrix spp.); 
corn rootworm (Diabrotica Spp.) ; cowpea weevil (Callosobru- 
chus maculatus) ; boll weevil (Anthonomus grandis) ; rice 
weevil (Sitophilus oryza) ; maize weevil {Sitophilus zea- 

10 mais) ; granary weevil (Sitophilus granarius) ; Egyptian 
alfalfa weevil (Hypera postica) ; bean weevil (Acanthosceli - 
des obtectus) ; lesser grain borer (Rhyzopertha dominica) ; 
yellow meal worm (Tenebrio molitor) ; Thysanoptera , especi- 
ally, western flower thrips (Frankliniella occidentalis) ; 

15 Diptera, especially, leaf miner spp. (Liriomyza trifolii) ; 
plant parasitic nematodes especially the potato cyst 
nematodes (Globodera spp.), the beet cyst nematode (Hetero- 
dera schachtii) and root knot nematodes {Meloidogyne spp.) 

Another class of digestive enzymes are the aspartic protea- 

20 ses. The term "aspartic protease" is intended to describe a 
protease that possesses two highly reactive aspartic acid 
residues at the * catalytic site of the enzyme and which is 
most often characterized by its specific inhibition with 
pepstatin, a low molecular weight inhibitor of nearly all 

25 known aspartic proteases. There is evidence that many 
phytophagous insects rely, in part, on midgut aspartic 
proteases for protein digestion most often in conjunction 
with cysteine proteases. These include but are not limited 
to Hemiptera especially (Rhodnius prolixus) and bedbug 

30 {Cimex spp.) and members of the families Phymatidae, 
Pentatomidae, Lygaeidae and Belos tomat idae ; Coleoptera, in 
the families of the Meloidae, Chrysomel idae , Coccinelidae 
and Bruchidae all belonging to the series Cucuj if ormia , 
especially, Colorado potato beetle (Lepcinotarsa decemline- 

35 ata) three-lined potato beetle (Lema trilineata) ; southern 
and western corn rootworm (Diabrotica undecimpunctata and 
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D. virgifera) , boll weevil (Anthonomus grand! s) , squash bug 
(Anasa tristis) ; flea beetle {Phyllotreta crucifera) , 
bruchid beetle {Callosobruchus maculatus) , mexican bean 
beetle (Epilachna varivestis) , soybean leafminer (Odontota 
horni) margined blister beetle (Epicauta pestlfera) and the 
red flour beetle (Tribolium castaneum) ; Diptera, especially 
housefly (Musca domestica) (Terra and Ferreira (1994) Como . 
Biochem . Phvsiol > 109B: 1-62; Wolf son and Murdock (1990) J . 
Chem. Ecol , 16: 1089-1102) 

Compounds that form complexes with proteases and inhibit 
their proteolytic activity are widespread in nature. A 
variety of "low molecular weight" protease inhibitors are 
known, largely of non-natural synthetic origin. A number of 
naturally occurring low molecular weight inhibitors have 
been isolated from bacterial and fungal sources and charac- 
terized; this group includes such inhibitors as E64 (N- (L- 
3 - trans carboxyoxiran - 22 - carbomyl ) -L-leucyl - amido- 4 -guani - 
dobutane) , leupeptins, antipains and pepstatins. 

Several proteinaceous protease inhibitors have been isola- 
ted from plant species and are among the defensive chemi- 
cals in plant tissues that are both developmentally regula- 
ted and induced in response to insect and pathogen attacks. 
Inhibitors of serine-, cysteine-, aspartic-, and metallo- 
proteases have been found in plants and especially in 
storage organs such as tubers and seeds. The most common 
and widely studied group of plant protease inhibitors are 
those that inhibit the animal serine proteases, which 
include trypsin and chymotrypsin (see Ryan (1990) Annu . 
Rev. Phytopathol. 28: 425-449). 

Proteinaceous cysteine protease inhibitors decrease or 
eliminate the catalytic activity of a cysteine protease. 
The pH optima of cysteine proteases is usually in the range 
of 3.5-7, which is the pH range in the lumen of midguts of 
insects that use cysteine proteases. Inhibitors of cysteine 
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proteases are dominated by the cystatin family which is 
subdivided into four subfamilies with respect to molecular 
weight, the number of disulphide bonds, subcellular locali- 
zation, and primary structure characteristics. The classi- 
fication system is mainly based on information regarding 
vertebrate and plant cystatins . Cystatins have been tested 
against insects and nematodes both in vitro and in vivo. 

Very few other examples of proteinaceous inhibitors of 
cysteine proteases exist to date. From potato one other 
family of cysteine protease inhibitors is known which 
belongs to the plant Kunitz family of inhibitors and which 
also includes aspartic protease inhibitors (Strukelj (1992) 
Biol. Chem. Hoppe-Seyler 373: 477-482; Krizaj et al . (1993) 
FEBS Letters 333: 15-20). This inhibitor, Kunitz PCPI8.3, 
is a tight inhibitor of Cathepsin L (Ki=0.07 nM) and a good 
inhibitor of papain (Ki=:3.3 nM) . From Diabrotica virgifera 
a completely novel type of thiol protease inhibitor was 
isolated (World Patent WO 95/24479) . This inhibitor bears 
no structural relationship to other known cysteine protease 
inhibitors . 

Recently, a new class of cysteine procease inhibitors 
emerged. These proteins have a type I repeated thyroglobu- 
lin domain in common (Malthiery and Lissitzky (1987) Eur ., 
J. Biochem . 165: 491-498). From humans a protein fragment 
derived from human MHC class II -associated p41 invariant 
chain was isolated (Ogrinc et al . (1993) FEBS Letters 336 : 
555-559; Bevec et al . (1996) J . Exp . Med . 183: 1331-1338). 
It is a tight inhibitor of Cathepsin L (Ki = 0.0017 nM) and a 
good inhibitor of papain (Ki = l . 4 nM) . A similar cysteine 
protease inhibitor with a type I repeated thyroglobul in 
domain was isolated from the eggs of chum salmon (Yamashita 
and Konagaya (1996) J . Biol . Chem . 271: 1282-1284). Final- 
ly, from the sea anemone, Actinia equina, a cysteine 
protease inhibitor designated equistatin was isolated with 
three type I repeated thyroglobul in domains (Lenarcic et 
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al. (1997) J. Biol. Chem. 272: 13899; Lenarcic et al (1998) 
J. Biol. Chem . 273: 12682). 

Apart from human invariant chain, ECI (egg cysteine 
protease inhibitor) and equistatin, parts of other proteins 
also have domains homologous to type I repeated thyroglobu- 
lin domains, including proteins such as rat invariant chain 
(McKnight et al . (1989) Nucleic Acids Res . 17: 3983-3984), 
saxiphilin (Morabito and Moczydlowski (1994) Proc . Natl . 
Acad. Sci. USA 91: 2478-2482), nidogen (Mann et al . (1989) 
EMBO J . 8: 65-72), epithelial glycoprotein (Simon et al . 
(1990) Proc. Natl. Acad. Sci. USA 87: 2755-2759), IGF- 
binding protein-3 (Brewer et al . (1988) Biochem. Biophvs . 
Res, Commun. 152: 1287-1289), testican (Alliel et al . 
(1993) Eur. J. Biochem . 214: 347-350) and entactin (Durkin 
et al. (1988) J. Cell. Biol . 107: 2749-2756) (Fig. 3). For 
entactin and thyroglobulin, it was published that they do 
not inhibit cysteine proteases (Yamashita and Konagaya 
(1996) J. Biol. Chem. 271: 1282-1284) . These proteins do 
contain the conserved sequences and the reason for the lack 
of inhibition is obscure. 

Proteinaceous aspartic protease inhibitors decrease or 
eliminate the catalytic activity of an aspartic protease. 
The pH optima of aspartic proteases is usually in the range 
of 2-5, which is the pH range in the parts of the gut where 
aspartic proteases are active. Very few proteinaceous 
inhibitors of aspartic proteases are known. One well 
characterized family of cathepsin D inhibitors is found in 
potato and related Solanaceae (Strukelj et al . (1992) Biol. 
Chem. Hoppe-Seyler 373:477-482). No in vitro enzymatic 
assay tests or in vivo bioassays have been published on the 
use of potato asparcic protease inhibitors against insects 
or nematodes . 

Australian Patent Application No. 36558/89 teaches that 
animal cystatins {such as hen egg white cystatin and 
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kininogens) and low molecular weight , non- peptide cysteine 
protease inhibitors (such as E-64, antipain and leupeptin) 
may be effective in the control of a variety of Coleoptera 
which utilize cysteine proteases for digestion. 

WO 92/21753 teaches that multicystat in, an 8-domain phyto- 
cystatin from potato, is more effective than other cysta- 
tins in the control of a variety of insects utilizing 
cysteine proteases for digestion of protein in the midgut, 
because it is more resistant to proteolysis by carboxypep- 
tidases . 

WO 96/16173 teaches that modified cystatins can protect 
plants against nematodes . 

WO 95/24479 teaches that a novel thiol protease inhibitor 
isolated from the corn rootworm and designated virgiferin 
can protect plants against insects and nematodes that have 
thiol proteases as digestive enzymes . 

Evidence for these claims was published also in the scien- 
tific literature for different coleopteran and hemipteran 
insects as well as for nematodes (Chen et al . (1992) 
Protein Express Purification 3 : 41-49 ; Edmonds et al . 
(1996) Entomol . Exp . Aopl . 78: 83-94; Elden (1995) J . 
Econ. Entomol . 88: 1586-1590; Orr et al . (1994) J. Insect 
Physiol . 40 : 893-900 ; Kuroda et al . (1996) Biosci . Bio- 
tech. Biochem . 60: 209-212 ; Leple et al . (1995) Molecular 
Breeding 1: 319-328; Urwin et al . (1995) Plant J . 8, 121- 
131) . At high concentrations the cystatins caused mortality 
or reduced the fertility and growth of some insects and 
nematodes (Table 1). However, the cysteine protease inhibi- 
tor concentrations required to achieve agronomical ly 
interesting levels of protection in artificial diets (2 00 - 
2000 (iM , see Table 1) in most cases are much higher than 
can be achieved in transgenic plants (10-40 /iM, see Table 
1 ) and also much higher than the accual protease concent r a - 
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tions which they are expected to inhibit (10-30 pM) . The 
high concentrations are required most likely because they 
do not bind all the different molecular forms of cysteine 
proteases present in the gut tightly enough. Weak inhibi- 
5 tors can still inhibit proteases when they are present in 
large excess to the protease. It is estimated that between 
10 and 30 different proteolytic enzymes are active in the 
gut and the transcription of proteases that are not inhibi- 
ted can be actively induced by the insect to compensate for 

10 the inhibition of other proteases (Jongsma et al . (1995) 
Proc. Natl. Acad. Sci USA 92: 8041-8045; Bolter and Jongsma 
(1995) J. Insect Phvsiol . 41: 1071-1078) . A second reason 
for their lack of toxicity may be that they are unstable in 
the gut environment and degraded by proteases which are not 

15 inhibited. 

The mere fact a protease inhibitor is an inhibitor of 
cysteine or aspartic proteases, therefore, does not neces- 
sarily mean it will be effective in vitro against insects 
utilizing these proteases for digestion (see also WO 

20 92/21753) . In general, it can be said that it is rare that 
any single inhibitor will completely inhibit the entire 
spectrum of cysteine or aspartic protease activity in an 
insect or nematode gut at a normal concentration that can 
be achieved in plants (10-40 ftM) . Inhibitors which at the 

25 same time inhibit both cysteine and aspartic proteases of a 
certain insect have never been described before, even 
though their utility is obvious, as many insects rely on 
the combination of these two classes of proteolytic enzymes 
for digestion. Many of the listed insects in table 1 rely 

30 on both types of proteases and the fact that the inhibitors 
are often not highly toxic to the insects is likely to be 
caused by the fact that the aspartic proceases remain free 
to digest the dietary protein and the cysteine protease 
inhibitors . 
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Table 1. Cysteine protease inhibitors that affect tfitness 
parameters of insects when administered in diets or expres- 
sed in transgenic host plants 



Insect species 


Pi-level 
in 

diet/plant 


Effect 


Reference 


Artificial diets supplemented with cystatins 


Tribolium castaneum (Col.) 


10, 000 


35% WR 


Chen et al . , 
1992 


Hepera postica (Col.) 


200 


RF 


Elden 1995 


Di abro ti ca undecimpuncta ta 
(Col . ; 


100-200 


40-70% M 


Edmonds e t 
al., 1996 


DiaJbrotica undecimpunctata 
(Col . ; 


125 M9/ cm2 


50% WR 


Orr ec al . , 
1994 


Diabrotlca vlrgiiera 
(Col.) 


125 /ig/cm2 


50% WR 


Orr et al . , 
1994 


Callosobruchus chlnensis 
(Col . ) 


100-2, 000 


10-100% M 


Kuroda et al . , 
1996 


Riptortus clavatus (Hem.) 


100-2,000 


0-100 % M 


Kuroda et al . , 
1996 


Transgenic plants expressing cystatins 


Globodera pallida (Nemat.) 


10 

(tomato) 


empty 
cysts 


Urwin et al . , 
19 95 


Chrysomela tremulae (Col . ) 


40 

(poplar) 


40% M 


Leple et al . t 
1995 



WR=weight reduction; RF=reduced fertility; M=mortality 



Prior literature exists which demonstrates that some 
insects like the Colorado potato beetle, are particularly 
insensitive to protease inhibitors , even when they are 
isolated from completely unrelated sources like rice or 
humans (Michaud et al . (1995) Insect Biochem. Molec . Biol . 
25: 1041-1048; Michaud et al . (1996) Archives of Insects 
Biochemistry and Physiology 31: 451-464 ; Michaud et al . 
(1993) FEBS Letters 331: 173-176). Some of these protease 
inhibitors when tested against other insects were found to 
be quite effective (Leple et al . (1995) Molecular Breeding 
1: 319-328). In Colorado potato beetle, however, these 
inhibitors v/ere demonstrated to be either too specific for 
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only one type of protease activity or to be broken down by 
aspartic proteases . 

The structural requirements for a cysteine or aspartic 
protease inhibitor to more effectively inhibit insect or 

5 nematode gut proteases is completely unknown. Plants, 
especially when related to the host plant, are poor sources, 
of effective inhibitors, because insects will have evolved 
protease inhibitor insensitive proteases against these. It 
is possible to test cysteine protease inhibitors from other 

10 sources than plants, but to date no proteinaceous inhibi- 
tors of aspartic proteases other than from Solanaceae have 
been described. The most desirable type of protease inhibi- 
tor for pest control of insects utilizing cysteine and/or 
aspartic proteases for digestion would simultaneously 

15 inhibit more than 90% of both activities in insects that 
have been reared on their host plant in order to specifi- 
cally target the host plant protease inhibitor insensitive 
protease complement. Such inhibitors are not known to the 
art . 

20 It has now been found that cysteine protease inhibitors 
selected from the group of proteins containing at least one 
type I repeated thyroglobulin domain are effective in vivo 
against insects of nematodes utilizing cysteine proteases, 
and surprisingly it has been found that said inhibitors are 

25 particularly active towards insect cysteine proteases which 
are insensitive to host plant derived cysteine protease 
inhibitors. Such a property is unprecedented among other 
types of cysteine protease inhibitors including those of 
non-plant origin. As a result said inhibitors are highly 

30 toxic to for example Colorado potato beetle larvae. 

Accordingly in one aspect, the invention relates to a 
method of protecting a plant or a part of said plant 
against insect or nematode infestation by one or more 
insects or nematodes having digestive cysteine proteases, 
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comprising presenting to a locus wherein said insect (s) or 
nematode (s) is (are) to be controlled an inhibitory amount 
of a cysteine protease inhibitor selected from the group of 
proteins containing at least one type I repeated thyroglo- 
5 bulin domain. 

Further, it has now been found that some type I repeated 
thyroglobulin domains can also be effective against aspar- 
tic proteases. It is very valuable that the activities 
against aspartic proteases and cysteine proteases are 
10 present on similar structural domains and can be combined 
in one protein like equistatin, because it is established 
by the inventors that they act synergistically by more 
completely inhibiting all insect protease activity. 

Accordingly, in a second aspect the invention relates to a 
15 type I repeated thyroglobulin domain inhibitor peptide with 
activity towards aspartic proteases, said peptide having 
the amino acid sequence extending from amino acid position 
68-199 of equistatin of Fig. 1 or a modified type I repea- 
ted thyroglobulin aspartic protease inhibitor peptide 
20 wherein said modified peptide comprises a peptide having 
substantial amino acid identity to amino acid position 68- 
199 of equistatin; truncations of amino acid position 68- 
199 of equistatin; or truncations of the peptide having 
substantial amino acid identity to amino acid position 68- 
25 199 of equistatin, wherein said modified peptide is functi- 
onally equivalent to said amino acid position 68-199 of 
equistatin with aspartic protease inhibitor activity. 

In a third aspect, the present invention relates to an 
insecticidal or nematocidal composition containing a 
30 protein containing at least one type I repeated thyroglobu- 
lin domain, wherein the composition is capable of improving 
the resistance of plant tissue otherwise susceptible to 
infestation by one or more insects or nematodes having 
digestive cysteine and/or aspartic proteases. 
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x Accordingly , the invention provides an agricultural.* compo- 
sition containing a carrier and an insect or nematode 
controlling or combatting amount of a cysteine and/or 
aspartic protease inhibitor as defined herein. 

In a fourth aspect, the invention relates to vectors enco- 
ding and capable of expressing a peptide containing one or 
more type I repeated thyroglobulin domains in a plant cell. 

Accordingly, the invention provides a biologically functio- 
nal expression vehicle containing a promoter effective to 
promote expression of a downstream coding sequence in 
plant cells, a DNA coding region coding for the expression 
in plant cells of protein composed of at least one type I 
repeated thyroglobulin domain and a termination sequence 
effective to terminate, transcription or translation of the 
genetic construction product in plant cells, the genetic 
construction effective to express in the cells of the plant 
insect controlling amounts of the protein containing at 
least one type I repeated thryoglobulin domain. 

Further the invention provides a method of protecting a 
plant or a part of said plant against insect or nematode 
infesctation comprising inserting into the genome of the 
plant a sequence coding for a protein containing at least 
one type I repeated thyroglobulin domain with a promoter 
sequence active in the plant to cause expression of said 
protein at levels which provide an insect or nematode 
controlling amount of said protein. 

In particular, said method comprises the steps of : 

(a) culturing cells or tissues from the plant; 

(b) introducing into the cells or tissue at least one 
copy of a gene coding for the protein containing at least 
one type I repeated thryoglobulin domain; 

(c) regenerating resistant whole plants from the cell 
or tissue culture. 
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In a fifth aspect, the invention relates to transformed 
cells and cell cultures, of cells which possess genes 
encoding a peptide containing one or more type I repeated 
thyroglobulin domains capable of protecting plant tissue 
5 otherwise .susceptible to infestation by one or more insects 
or nematodes having digestive cysteine and/or aspartic 
proteases . 

Further, the invention provides a transgenic plant and its 
sexual progeny resistant to attack by one or more insects 
10 or nematodes having digestive cysteine proteases, said 
transgenic plant expressing an insect of nematode control- 
ling amount of a protein containing at least one type I 
repeated thyroglobulin domain. 

In sixth aspect, the present invention relates to a process 
15 of preparing an insecticidal or nematocidal composition of 
a peptide containing one or more type I repeated thyroglo- 
bulin domains, wherein the composition is capable of 
improving the resistance of plant tissue otherwise suscep- 
tible to infestation by one or more insects or nematodes 
20 having digestive cysteine and/or aspartic proteases. 

A number of aspects of the present invention are further 
illustrated in the accompanying drawings, in which 

Figure 1 shows the nucleotide sequence and deduced amino 
acid sequence of the equistatin gene from Actinia equina L. 

25 Figure 2 shows a comparison of all three domains of the 
cDNA encoded amino acid sequence of equistatin and the 
purified equistatin protein from Accinia equina L . with 
amino acid sequences of other proteins with type I repeated 
thyroglobulin domains with known and unknown protease 

30 inhibitor activity. 
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Figure 3 shows the in vitro effects of a wide range of 
different cysteine protease inhibitors on cysteine protea- 
ses of Colorado potato beetle larvae {Leptinotarsa decemli- 
neata) that are insensitive to the endogenous cysteine 
protease inhibitors of the host plant potato. 

Figure 4 shows the effects of equistatin on growth and 
mortality of Colorado potato beetle larvae 

Figure 5 shows effect of equistatin relative to other 
cysteine protease inhibitors on in vitro proteolytic 
activity of the western flower thrips Frankliniella occi- 
dental is 

Figure 6 shows the effect of equistatin on the fecundity of 
thrips females two days after being placed on a diet 
containing the inhibitor 

Figure 7 shows the plasmid map of pB3 -equistatin 

Figure 8 shows the construction of the plasmid pCABl- 
equistatin 

Figure 9 shows HPLC analyses of equistatin. Chromatograms 
shows the HPLC elution profile of equistatin after incuba- 
tion with different enzymes, In panel A the equistatin was 
incubated with cathepsin D in a final molar concentration 
of 2:1, and in panel 3 the equistatin was fragmented using 
1% (w/w) S-trypsin. Identities of peaks were based on the 
N- terminal sequences'. 

Figure 10 shows electrophoret ic analyses of: equistatin. A, 
SDS - PAGE of dissected equistatin. Lanes: 1 , molecular 
weight standards; 2, first domain of equistatin (eq d-1); 
3, the combined second and third domain of equistatin (eq 
d-2,3) . 3, Native PA.GE of the formation of the equistatin- 
cathepsin D complex. Lanes*. 1, cathepsin D; 2, equistatin 
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and cathepsin D mixed together 30 min before electrophore- 
sis; 3, equistatin. The gels were stained wich Coomassie 
blue . 



Figure 11 shows a schematic diagram of the function of 
5 equistatin fragments used. Sites of proteolytic cleavages 
are indicated by gaps and are shown as amino acid numbers . 
Cleavage sites obtained by the action of S-trypsin are 
indicated by arrows. The pairing of cysteine residues in 
disulphide bond is indicated by horizontal line connecting 
10 cysteine residues. 



Figure 12 shows active site titration of equistatin with 
cathepsin D. Inhibition of 77 nM cachepsin D with increa- 
sing concentrations of native equistatin. Residual activity 
is expressed as percent of control activity in samples 
15 containing no inhibitor. 

The entire teachings of all references cited herein are 
hereby incorporated by reference. 



It has now been determined that among the type I repeated 
thyroglobulin domains, domains exist which are active 

20 towards aspartic proteases of both human and insect origin. 
In combination with proteins (P41 invariant chain fragment 
and equistatin domain I) with domains that are active 
towards cysteine proteases they have potent inhibitory 
activity towards "protease inhibitor insensitive' 1 digestive 

25 cysteine and. aspartic proteases of a broad range of insect 
species belonging to different insect orders including 
Colorado potato beetle, thrips, leaf miner and corn root- 
worm. They were larvicidal when administered enterally to 
the larvae of insects having digestive cysteine and aspar- 

30 tic proteases such as the Colorado potato beetle and 
strongly reduced the fecundity of thrips which mainly 
depend on cysteine proteases for the digestion o£ protein. 
It is shown that this property of cype I repeated thyroglo- 
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bulin domains is unique among a very broad set of cysteine 
and aspartic protease inhibitors that were derived from 
nearly all known types of cysteine and aspartic protease 
inhibitors. It is shown, therefore, that it is not suffi- 
cient, as suggested before, to utilize inhibitors which are 
at a large evolutionary distance from plants, as these were 
equally inactive as plant-derived inhibitors. Instead, 
exclusively the cysteine and/or aspartic protease inhibi- 
tors containing the conserved features of a type I repeated 
thyroglobulin domain were able to fully inactivate the W PI- 
insensitive cysteine and aspartic proteases" of different 
insect species. Thus, this invention provides a method for 
killing insects and nematodes having "protease inhibitor 
insensitive" digestive cysteine and/or aspartic proteases, 
including larvae of Colorado potato beetle, comprising 
administering enterally to the larvae or nematodes a 
larvicidal or nematocidal amount of protein containing one 
or more type I repeated thyroglobulin domains depending on 
whether the insect utilizes one or more classes of protease 
for digestion. 

Definition of terms 

The terms protease inhibitor and proteinase inhibitor are 
considered equivalent . The term "protease inhibitor insen- 
sitive protease" is meant to indicate that such a protease 
is insensitive to host plant protease inhibitors raised in 
defense against the attacking pest, but is not meant to 
exclude that it can be inhibited by protease inhibitors 
isolated from sources other than the host plant. The terms 
insect and larva, although not equivalent when used speci- 
fically should be understood to include both adult and 
larval forms of a species when used generically. Thus, the 
term insect resistance should be understood to include 
resistance to larval forms as wel 1 as adults, and larvici- 
dal materials should be considered insect icidal , particu- 
larly since killing larvae produces a corresponding absence 
of adults. 
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In a preferred embodiment, the present invention is direc- 
ted to cysteine/aspartic protease inhibitors from the sea 
anemone Actinia equina, also referred to as equistatin. For 
purposes of this invention , "equistatin" is meant to 
5 include a protein encoded by a gene having the sequence set 
forth in Figure 1, or a functional derivative thereof. The 
equistatin peptide that was purified from the sea anemone 
Actinia equina showed the presence of three type I repeated 
thyroglobulin domains (Lenarcic et al. (1997) J. Biol. Chem. 

10 272: 13899; Lenarcic et al . (1998) J. Biol, Chem . 273: 
12682) . Screening a cDNA library from Actinia equina with a 
radiolabeled probe obtained by PCR using two degenerate 
primers on total cDNA resulted in a clone with a coding 
sequence containing a signal peptide for secretion and a 

15 mature protein part of three domains of nearly identical 
protein sequence compared to the purified protein. 

primers for the amplification of equistatin cDNA: 

El-degl : CT (A, C , G, T) AC ( A, C, G, T) AA ( A, G) TG (T , C) CA ( A , G) CA (A , G) 

EI-deg2 : ATT (A, G) AC ( A , G , C , T) TG ( A , C , G , T) GG { A , C , G , T) CG (T , C) T- 
20 T ( A , G) AA 

As can be seen from Figures 1 and 2 the mature protein 
component of equistatin is composed of 3 domains that 
appear to have resulted from the duplication of genetic 
material. On the basis of the preliminary cDNA sequence 

25 analysis, several structural isoforms of equistatin may 
occur in the Actinia equina. The 3 domains comprise a 22 kD 
polypeptide. Each domain comprises about 65-68 amino acids, 
with 3 presumed disulphide bonds. Based on the sequences of 
the domains, it is apparent that the protein is a member of 

30 the conserved type I repeated thyroglobulin domain compri- 
sing repeating type I domains. Specifically the domain 
sequences show high conservation of the amino acid sequen- 
ce : Cys- (Xxx) ia . 29 -Pro-Xxx-Cys- (Xxx) 3 -Gly- (Xxx) 3 -Gln-Cys- 
(Xxx) 5 -Cys-Thr-Cys-Val- (Xxx) 3 -Gly- (Xxx) 10 .. 5 -Cys . The three 
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domain inhibitor purified from Actinia equina was proteoly- 
tically cleaved into two major peptides and separated by 
reverse phase HPLC. Determination of the N- termini of both 
fragments allowed them to be located in the sequence. One 
peptide designated eqd-1 consisted of the first domain 
running from residue 1-67, whereas the second peptide 
designated eqd-2,3 contained domains 2 and 3 with residues 
68-199 The intact equistatin molecule could be inhibited 
by only 1 papain and 1 cathepsin D molecule. Inhibition 
assays with Eqd-1 and Eqd-2,3 determined that Eqd-1 could 
only be inhibited by papain and Eqd-2,3 only v/ith Cathepsin 
D. The inhibition constants for the separated domains were 
similar to the intact equistatin molecule. This demonstra- 
ted that even though these domains appear to be structural- 
ly conserved, that the . specificities for proteases has 
diverged to completely different classes of proteases. It 
is not possible with the present evidence to know which 
residues determine this difference in specificities. 

It should be understood that, given the present teachings, 
one may synthesize or isolate substantially pure functional 
derivatives of naturally-occurring equistatin molecules. A 
"functional derivative" of equistatin is a compound which 
possesses a biological actvity that is substantially 
similar to a biological activity of the equistatin molecu- 
le. The term functional derivative is intended to include 
"fragments", or "effectively homologous variants". 

A "fragment" of a molecule is meant to refer to any inhibi- 
tory polypeptide subset of a equistatin molecule. 

An "effectively homologous variant" of a molecule such as 
the equistatin molecule is meant to refer to a molecule 
substantially similar in sequence and function to either 
the entire molecule of to a fragment thereof . For purposes 
of this invention, these molecules are identified when they 
contain the type I repeated thyroglobui in domain. General- 
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ly, the effectively homologous sequences should retain high 
conservation at the naturally occurring positions of the 
conserved sequence Cys- (Xxx) 18 . 29 -Pro-Xxx-Cys- (Xxx) 3 -Gly- 
(Xxx) 5 -Gln-Cys- (Xxx) 6 -Cys-Thr-Cys-Val - (Xxx) : -Gly- (Xxx) l0 ., 5 - 
5 Cys. The two cysteines on either end of the conserved 
sequence are conserved, but they do not have conserved 
positions . They are likely to form structurally important 
di sulphide bridges with any one of the other cysteines , 
however, for which reason they were included. For purposes 

10 of this invention, the structure of one amino acid sequence 
is effectively homologous to a second amino acid sequence 
i'f at least 70 percent, preferably at least 80 %, and most 
preferably at least 90 % of the active portions of the 
amino acid sequence are identical or equivalent . General 

15 categories of potentially equivalent amino acids are set 
forth below, wherein, amino acids within a group may be 
substituted for other amino acids in that group: (1) 
glutamic acid and aspartic acid; (2 ) lysine, arginine and 
histidine; (3) alanine, valine, leucine and isoleucine; (4) 

20 asparagine and glut amine; (5) threonine and serine; (6) 
phenylalanine, tyrosine and tryptophan; and (7) glycine and 
alanine. More importantly and critical to the definition, 
the function of a second amino acid sequence is effectively 
homologous to another amino acid sequence if the second 

25 amino acid conforms to a tertiary structure having the 
capacity to decrease or eliminate the catalycic activity of 
a digestive cysteine and/or aspartic protease. 



As used herein, the term "substantially pure" is meant to 
describe protein containing at least one type I repeated 
thyroglobulin domain which is homogeneous by one more 
purity or homogeneity characteristics . For example, a 
substantially pure equistatin peptide molecule will show 
constant and reproducible characteristics within standard 
experimental deviations for parameters such as molecular 
weight, chromatographic behaviour and the like. The term, 
however, is not meant to exclude artificial or synthetic 



WO 98/58068 PCT/NL98/00352 

- 21 - 

mixtures of the equistatin peptide molecule with other 
compounds . The term is also not meant to exclude the 
presence of minor impurities which do not interfere with 
the biological activity of' the equistatin peptide molecule 
and which may be present, for example, due to incomplete 
purification. A substantially pure equistatin peptide 
molecule may be isolated from the source in which it 
naturally exists by any appropriate protein purification 
technique. Exemplary techniques include chromatographic 
techniques, such as gel filtration liquid chromatography, 
ion exchange chromatography, affinity chromatography, high 
performance liquid chromatography, reverse phase chromato- 
graphy or by use of immunological reagents employing anti- 
equistatin antibodies. 

Gene isolation 

It is possible to synthesize in vitro an equistatin peptide 
from the constituent amino acids (see Merrifield (1963), J . 
Amer. Chem. Soc . 85:2149-2154; and Solid Phase Peptide 
Synthesis (1969), (eds.) Stewart and Young). The peptides 
thus prepared may be isolated and purified by procedures 
well known in the art (see Current Protocols in Molecular 
Biology (1989), (eds) . Ausubel , et al . and Sambrook et al . 
(1989), Molecular Cloning: A laboratory Manual) . 

Although it is possible to determine and synthesize the 
entire amino acid sequence of the equistatin peptide, it is 
preferable to isolate the entire sequence of the equistatin 
gene. DNA encoding an equistatin peptide may be prepared 
from chromosomal DNA, cDNA or DNA of synthetic origin by 
using well-known techniques. 

Genomic DNA encoding an equistatin peptide may be isolated 
by standard techniques (Sambrook et al . (1989), supra) . 
Specifically comprehended as part of this invention are 
genomic DNA sequences encoding allelic variant forms of the 
equistatin gene, as v/ell as its 5' and 3' flanking regions. 
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It is also possible to use primers and exponentially 
amplify DNA in vitro using sequence specified oligonucleo- 
tides by the polymerase chain reaction (PCR) (see Mullis et 
al . (1987), Meth. Enz . , 155:335-350; Horton et al . (1989), 
5 Gene , 77:61; and PCR Technology : Principles and Applicati- 
ons for DNA Amplification , (ed.) Erlich (1989) . 

cDNA preparations are ligated into recombinant vectors to 
form a gene library. Alternatively, the cDNAs may be 
expressed in a vector such as lambda gtll and the library 
10 screened using antibodies against the equistatin peptide 
molecule . 

A suitable oligonucleotide, set of oligonucleotides or PCR- 
derived DNA fragments may be used, by techniques well known 
in the art, to screen the genomic DNA or cDNA libraries. To 

15 facilitate the detection of the desired sequence, the DNA 
probe may be labeled with any material having detectable 
physical or chemical property. General procedures for 
isolating, purifying and sequencing the desired sequences 
are well known in the art (see Current Protocols in Molecu- 

20 lar Biology (1989), supra: and Sambrook et al . (1989), 
supra) . 



An alternative way of obtaining a genetic sequence which is 
capable of encoding the protein containing at least one 
type I repeated thyroglobulin domain is to prepare it by 

25 oligonucleotide synthesis, after the gene sequence of 
interest is determined (see Caruthers (1983) , In: Methodo- 
1 oqv o f DNA and RNA , (ed.) Weissman) ; Beaucage et al . 
(1981), Tetrahedron Letters , 22; 1859-1962). A series of 
oligonucleotides may be synthesized in order co provide a 

30 series of overlapping fragments which when annealed and 
ligated will produce both strands of the gene. These 
fragments are then annealed and ligated together using well 
known techniques (see Sambrook et al . (1982), supra). 
Alternatively, the gene may be produced by synthesizing a 



WO 98/58068 PCT/NL98/00352 

- 23 - 

primer having a so-called "wagging call" , that does not 
hybridize with the target DNA; thereafter, the genomic 
sequences are amplified and spliced together by overlap 
extension (see Horton et al . (1989), Gene , 77:61-68) The 
5 resulting DNA fragment with the predicted size is isolated 
by electrophoresis and ligated into a suitable cloning 
vector for amplification and further manipulation (see 
Mullis et al . (1987) , supra; and PCR Technology: Principles 
and Applications for DNA Amplification , supra) . 

10 Of course, one may incorporate modifications into the 
isolated sequences including the addition, deletion, or 
non- conservative substitution of a limited number of 
various nucleotides or the conservative substitution of 
many nucleotides, provided that the proper reading frame is 

15 maintained. Translational stop and start signals ared added 
at the appropriate points, and sequences to create conve- 
nient cloning sites are added to the ends. Exemplary 
techniques for modifying oligonucleotide sequences include 
using polynucleotides -mediated, site -directed mutagenesis 

20 (see Zoller et al . (1984), DNA, 3:479-488); Higuchi et al . 
(1988), Nucl. Acids Res. , 16:7351-7367 ; Ho et al . (1989), 
supra; and PCR Technology: Principles and Applications for 
DNA Amplification , (ed.) Erlich (1989)). 

Gene expression 

25 In order to further characterize such genetic sequences, it 
is desirable to introduce the sequence inco a suitable host 
to express the proteins which these sequences encode, and 
confirm that they possess characterise ics of the equistatin 
peptide molecule. Techniques for such manipulation are 

30 well-known in the art and disclosed by Sambrook et al . 
(19S9) , supra. 

Vectors are available or can be readily prepared for 
transformation of viruses, prokaryocic or eukaryotic cells. 
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In general, plasmid or viral vectors should contain *all the 
DNA control sequences necessary for both maintenance and 
expression of a heterologous DNA sequence in a given host . 
Such control sequences generally include a promoter sequen- 

5 ce , a transcriptional start or leader sequence , a DNA 
sequence coding for translation start-signal codon, a 
translation terminator codon, and a DNA sequence coding for 
a 3' non-translated region containing a signals controlling 
termination of RNA synthesis and/or messenger RNA modifica- 

0 tion. Finally, the vectors should desirably have a marker 
gene that is capable of providing a pheno typical property 
which allows for identification of host cells containing 
the vector, and, in the case of monocot transformation, an 
intron in the 5' untranslated region, e.g., intron 1 from 

5 the maize alcohol dehydrogenase gene that enhances the 
steady state levels of mRNA. 



Exemplary host cells include prokaryotic and eukaryotic 
organisms. The appropriate procedure to transform a selec- 
ted host cell may be chosen in accordance with the host 
cell used. Based on the experience to date, there appears 
to be little difference in the expression of genes, once 
inserted into cells , attributable to the method of trans - 
formation itself . 



Conventional technologies for introducing biological 
material into host cells include electroporat ion [see 
Shigekawa and Dower (1988), Biotechniaues , 6:742 ; Miller, 
et al. (1988), Proc. Natl. Acad. Sci . USA . 85:856-860; and 
Powell, et al (1988), AddI . Environ . Microbiol . , 54:655- 
6 60 ) ; direct DNA uptake mechanisms (see Mandel and Higa 
(1972) , J . Moi . Biol . , 53:159-162 ; Dicyackin, et al . 
(1972), Biochimica et Bioohvsica Acta . 281:319-323; Wigler, 
et al. (1979), Cell , 16:77; and Uchimiya, et al . (1982), 
In: Proc. 5 th Intl. Cong. Plant Tissue and Cell Culture . A. 
Fujiwara (ed.), Jap. Assoc. for Plane Tissue Culture, 
Tokyo, pp. 507-508); fusion mechanisms [see Uchidax, et al . 
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(1980), In: Introduction of Macromolecu3.es Into Viable 
Mammalian Cells , C. Baserga, G. Crose, and G. Rovera (eds.) 
Wistar Symposium Series, Vol. 1, A. R. Liss Inc., NY, pp. 
169-185]; infections agents [see Fraley, et al (1986), CRC 
5 Crit. Rev, Plant Sci . , 4:1-46); and Anderson (1984), 
Science, 226:401-409]; microinjection mechanisms (see 
Crossway, et al (1986), Mol . Gen. Genet . . 202:179-185] and 
high velocity projectile mechanisms [see EPO 0 405 696] . 

Transf ormants are isolated in accordance with conventional 
10 methods, usually employing a selection technique, which 
allows for selection of the desired organisms as against 
unmodified organisms. Generally, after being transformed, 
the host cells are grown for about 43 hours to allow for 
expression of marker genes. The cells are then placed in 
15 selective and/or screenable media, where untransf ormed 
cells are distinguished from transformed cells, either by 
death or a biochemical property. The selected cells can be 
screened for expression of the equistatin peptide molecule 
or functional derivatives thereof by assay techniques such 
20 as immunoblot analysis, inhibitory activity assay, enzyme- 
linked immunosorbent assay, radioimmunoassay, or fluores- 
cence-activated cell sorter analysis, immunohistochemistry 
and the like. The transformed tissues are then tested for 
insect controlling activity. 

25 A host cell may be transformed to provide a source from 
which significant quantities of the vector containing the 
gene of interest can be isolated for subsequent introducti- 
on into the desired host cells or for which significant 
quantities of the protein may be expressed and isolated. 

30 Exemplary recombinant host cells include unicellullar 
prckaryotic and eukaryotic strains. Prokaryocic microbes 
that may be used as hosts incluse Escherichia coli, and 
other Enterobacteriaceae , Bacilli, and various Pseudomonas . 
Common eukaryotic microbes include Sacchromyces cerevisiaa 

35 and Pichia pastoris. Common higher eukaryotic host cells 
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include Sp2/0 or CHO cells. Another preferred host is 
insect cells, for example Drosophila larvae, in which the 
vector contains the Drosophila alcohol dehydrogenase 
promoter . 

5 Alternatively, baculovirus vectors, e.g., Autographs 
californica nuclear polyhedrosis virus (see Miller et al . 
(1933), Science, 219:715-721) may be engineered to express 
large amounts of the equistatin peptide molecule or functi- 
onal derivatives thereof in cultured insects cells (see 
10 Andrews et al . (1988), Biochem J. , 252:199-206. 

Agricultural composition 

The present invention provides an agricultural composition 
for application to plants or parts thereof which are 
susceptible to infestation by insects or nematodes having 

15 digestive cysteine proteases, said agricultural composition 
comprising an protein containing at least one type I 
repeated thyroglobulin domain. Often the agricultural 
composition will contain an agriculturally acceptable 
carrier. By the term "agriculturally acceptable carrier" is 

20 meant a substance which may be used to dissolve, disperse 
or diffuse an active compound in the composition without 
impairing the effectiveness of the compound and which by 
itself has no detrimental effect on the soil, equipment, 
crops or agronomic environment . 

25 The agricultural compositions may be applied in a wide 
variety of forms including powders, crystals, suspensions, 
dusts, pellets, granules, encapsulations, microencapsulati- 
ons, aerosols, solutions, gels or other dispersions. In 
addition to appropriate liquid or solid carriers, compos i - 

30 tions may include adjuvants, such as emulsifying and 
wetting agents, spreading agents, dispersing agents, 
adhesives or agents which stimulate insect feeding accor- 
ding to conventional agricultural practices. Adjuvants for 
the formulation of insecticides are well known to those 

35 skilled in the art. 
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The concentration of protein containing at least one type I 
repeated thyroglobulin domain will vary widely depending 
upon the nature of the particular formulation, particularly 
whether it is a concentrate or it is to be used directly. 
The protein containing at least one type I repeated thyro- 
globulin domain generally will be present in at least . 1 
percent by weight and may be up to 100 percent by weight. 

The presentation of the agricultural composition may be 
achieved by external application either directly or in the 
0 vicinity of the plants or plants parts. The agricultural 
compositions may be applied to the environment of the 
insect pest(s). e.g., plants soil or water, by spraying, 
dusting, sprinkling, or the like. 

The present invention further contemplates using recombi- 
5 nant hosts (e.g., microbial hosts and insect viruses) 
transformed with a gene encoding the protein containing at 
least one type I repeated thyroglobulin domain and applied 
on or near a selected plant or plant part suceptible to 
attack by a target insect. The hosts are selected capable 
20 of colonizing a plant tissue suceptible to insect infesta- 
tion or of being applied as dead or non-viable cells 
containing the protein containing at least one type I 
repeated thyroglobulin domain. Microbial hosts of particu- 
lar interest will be the prakaryotes and the lower eukaryo- 
25 tes, such as fungi. 

Characteristics of microbial hosts for encapsulating an 
protein containing at least one type I repeated thyroglobu- 
lin domain include protective qualities for the protein, 
such as thick cell walls, pigmentation, and intracellular 
30 packaging or formation of inclusion bodies; loaf affinity; 
lack of mammalian toxicity; attractiveness to pests for 
ingestion; ease of killing and fixing without damage to the 
protein containing at least one type I repeated thyroglobu- 
lin domain; and the ability to be treated to prolong the 
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activity of the protein containing at least . one »type I 
repeated thyroglobulin domain. Characteristics of microbial 
hosts for colonizing a plant include non-phytotoxicity ; 
ease of introducing a genetic sequence encoding an protein 
5 containing at least one type I repeated thyroglobulin 
domain, availability of expression systems, efficiency of 
expression and stability of the insecticide in the host. 



Illustrative prokaryotes, both Gram-negative and -positive, 
include Entierobacteriaceae , such as Escherichia; Bacillace- 

10 ae; Rhizohoceae, such as -Rhizobium and Rhizobacter; Spiril- 
laceae (such as photobacterium) t Zymomonas , Serratia , 
Aeromonas, Vibrio, Desulfovibrio, Spririllum, Lactobacilla - 
ceae; Pseudomonadaceae (such as Pseudomonas and Acetobac- 
ter) Azotobacteriaceae and Nitrobacteriaceae . Among eukary- 

15 otes are fungi (such as Phycomycetes and Ascomycetes) , 
which includes yeast (such as Saccharomyces and Schizosac- 
charomyces) ; and Basidiomycetes yeast (such as Rhodotorula , 
Aureobasidium, Sporobolomyces) and the like. 

The present invention also contemplates -the use of a 
20 baculovirus containing a gene encoding an protein contai- 
ning at least one type I repeated thyroglobulin domain. 
Baculoviruses including those that infect Heliothis vires- 
cens (cotton bollworm) , Orgyla psuedotsugaca (Douglas fir 
tussock moth) , Lymantria dispar (gypsy moth) , Autographica 
25 californica (Alfalfa looper) , Neodiprion serf iter (European 
pine fly) and Laspeyresia pomonella (coddling moth) have 
been registered and used as pesticides (see US 4, 745, 051 
and EP 175 852) . 



The recombinant hose may be formulated in a variety of 
30 ways. It may be employed in v/et table powders, granules or 
dusts, or by mixing with various inert materials, such as 
inorganic minerals (phyllosilicates , carbonates, sulfates, 
phosphates, and the like) or botanical materials (powdered 
corncobs, rice hulls, walnut shells, and the like) . The 
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formulations may include spreader- sticker adjuvants; 
stabilizing agents, other insecticidial additives surfac- 
tants, and bacterial nutrients or other agents to enhance 
growth or stabilize bacterial cells. Liquid formulations 
may be aqueous -based or non-aqueous and employed as foams, 
gels, suspensions, emulsifiable concentrates, or the like. 
The ingredients may include rheological agents, surfac- 
tants, emulsifiers, dispersants, or polymers. 

Transgenic plants 

Alternatively, the protein containing at least one type I 
repeated thyroglobulin domain can be incorporated into the 
tissues of a susceptible plant so that in the course of 
infesting the plant the insect consume insect -controlling 
amounts of the selected protein containing at least one 
type I repeated thyroglobulin domain. This method offers 
particular advantages to reach plant tissues digested by 
insects or nematodes that are normally very hard to reach 
by conventional application of pesticides. In addition, 
there are important economical and environmental benefits 
to be gained when the need to apply pesticides can be 
reduced . 

The method also offers advantages when the potential for 
insects becoming resistant is considered. Heavy application 
of insecticidal materials generally to a field or a geo- 
graphical area by dust or spray or by soil incorporation 
tends to impose strong selection pressures, since insects 
have no "safe havens" where non-resistant individuals can 
survive. However, many insect pests o£ crop plants also 
attack non-crop species. Limiting the insecticidal materi- 
als to the crop plants in the region by expressing the 
insecticidal materials only in those plants permits conti- 
nued survival of non-resistant insects in associated weed 
plants which provide noc only "safe havens" from the toxic 
compound but food for the insects. This reduces slection 
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pressure significantly and this slows development and 
spread of resistant insects. 

This method also offers advantages from the standpoint of 
soil and groundwater contamination, since no application 

5 vehicle is required. The insect icidal components themselves 
are of natural origin and break down naturally when the 
plant is digested or decomposes. The method offers further 
advantages from the standpoint of cost, since no applicati- 
on expense is involved and the cost of the insecticidal 

10 materials is factored into the price of the seed or other 
reproductive material which the grower purchases. 

One method of doing this is to incorporate the protein 
containing at least one type I repeated thyroglobulin 
domain in a non-phytotoxic vehicle which is adapted for 

15 systemic administration to the susceptible plants. However, 
since the -genes which code for protein containing at least 
one type I repeated thyroglobulin domain may be isolated, 
the invention contemplates, in a preferred embodiment, 
transgenic plants which are capable of biologically synthe- 

20 sizing proteins containing at least one type I repeated 
thyroglobulin domain to provide the plants with a new, or 
an additional, mechanism of protection again.st attack by 
insects or nematodes. 

The invention provides methods of imparting resistance to 
25 insect infestation by insects having digestive cysteine 
and/or aspartic proteases to plants of a susceptible taxon, 
comprising: (a) culturing cells or tissues from at least 
one plant from the taxon; (b) introducing into the cells of 
the cell or tissue culture a structural gene encoding an 
30 protein containing at least one type I repeated thyroglobu- 
lin domain operably linked to plant regulatory sequences 
which cause expression of the gene in the cells, and (c) 
regenerating insect -resistant whole planes from the cell or 
tissues culture. 
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The expression of uniquely high quantities of proteins 
containing at least one type I repeated thyroglobulin 
domain may be deleterious to the plant itself. The use of 
signal sequences to secrete or sequester in a selected 
organelle allows the protein to be in a metabolically inert 
location until released in the gut environment of an insect 
pathogen. 

The DNA sequence will generally be one which orginates 
from, or has substantial sequence homology to an protein 
containing at least one type I repeated thyroglobulin 
domain, originating from an organism different from that of 
the target organism. 

Optimal expression in plants 

In order to optimize the transcriptional and translational 
efficiency of such systems, it is possible to examine the 
frequency of codon usage and determine which codons are, in 
essence, preferred within the transcriptional and transla- 
tional systems normally present in that plant. Using such 
preferred usage codons, it is possible to construct a 
protein coding sequence which may result in a significantly 
enhanced level of transcriptional and translational effi- 
ciency of the equistatin gene or a functional derivative of 
that gene compared to what would be achieved by taking the 
coding sequence directly in an ummodified form of the donor 
organism. In addition the coding sequence may be optimized 
further by removing potential plant poly-adenylat ion 
signals, cryptic splicing sites and mRNA instability motifs 
as was shown for Bacillus thuringiensis toxin genes. 

Generally, the insertion of heterologous genes appears to 
be random using any transformation technique; however, 
technology currently exists for producing plants with site 
specific recombination of DNA into plant cells (see WO/910- 
9957) . The activity of the foreign gene inserted into plar.c 
cells is dependent upon the expression characteristics of 
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the individual inserted genes, resulting from control 
regions (promoters, poly-adenylation regions, enhancers, 
etc.) and from the influence of endogenous plant DNA 
adjacent the chimeric insert and by the copy number. 

5 The promoter selected should be capable of causing suffi- 
cient expression to result in the production of an insect 
controlling amount of protein. Suitable promoters may 
include both those which are derived from a gene which is 
natually expressed in plants and synthetic promoter sequen- 

10 ces which may include redundant or heterologous enhancer 
sequences. A number of promoters which are active in plant 
cells include the nopaline synthase, octopine synthase and 
mannopine synthase promoters from the tumor- inducing 
plasmids of Agrobacterium tumefaciens . The present inventi- 

15 on contemplates constitutive promoters such that the 
transformed plant has increased tolerance to insect pests . 
Examples of constitutive promoters include the CaMV 19S and 
35S promoters (JP 63287485) , ubiquitin promoter, the rice 
actin promoter (WO/ 9109948) . 

20 In species which produce a native protein containing at 
least one type I repeated thyroglobul in domain which is not 
produced in or not distributed to tissues which are normal- 
ly infested wich the insects, a tissue specific promoter 
can be used to provide localized expression of or overpro- 
25 duction of the protein containing at least one type I 
repeated thyroglobulin domain. Examples of tissue specific 
promoters include the root specific promoters such as maize 
metallothionein (EP 452269) , the root specific promoter 
(WO/9113992) the plant seed storage body promoter (WO/5113- 
30 993), and the alcohol dehydrogenase - 1 promoter. Promoters 
known to be light inducible include the promoter of the 
gene encoding the small subunit (ss! of the r ibulose - 1 , 5 , - 
bisphosphate carboxylase from soybean and the promoter of 
the gene encoding the chlorophyll a/b binding protein in 
35 greening leaves (Coruzzi et al . , (1983), J . Biol . Chem. , 
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258:1399;. and Dunsmuir, et al . (1983), J. Molecular and 
Add . Gen . , 2:285; Nap et al . (1993) Plant Molec. Biol . 23: 
605-512 . 

Finally, a wound or pathogen inducible promoter can be used 
co provide expression of the proteins containing at least 
one type I repeated thyroglobulin domain when a tissue is 
attacked by a plant pest. Examples of wound or pathogen 
inducible promoters include the proteinase inhibitor II 
promoter. 

Plant vectors 

Suitable vectors for transforming plant tissue and proto- 
plasts have been described in the literature and are set 
forth herein (see deFrammond et al. (1983), Biotechnology , 
1: 262; An et al. (1985), EMBO J . 4: 277; Potrykus et al - 
(1985), Mol. Gen. Genet. 199: 183; Rothstein et al 1937), 
Gene , 53: 153; WO 90/08829 and WO 84/02913; and, in a 
preferred embodiment, pCABl (as described in the Examples) . 
It is not necessary in practice that the vector harboring 
the selectable marker gene also contain the gene of inte- 
rest. Rather, co-transformation of such vectors may be used 
to transform plant cells. 

Transformation procedure 

The appropriate procedure to produce mature transgenic 
plants may be chosen in accordance with the plant species 
25 used. Regeneration varies from species to species of 
plants. Efficient regeneration will depend upon the medium, 
on the genotype and on the history of the culture. Once 
whole plants have been obtained, they can be sexually or 
clonal ly reproduced in such a manner that at least one copy 
30 of the sequence is present in the cells of the progeny of 
the reproduction. Such procedures may be chosen in accor- 
dance with the plant species used. 



ft 
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Breeding transgenic plants * 
Mature plants, grown from the transformed plant cells, may- 
be selfed to produce an inbred plant. In diploid plants, 
typically one parent may be transformed and the other 

5 parent may be the wild type. The parent will be crossed to 
form first generation hybrids (Fj) , which are selfed to 
produced second generation hybrids (F 9 ) . F 2 hybrids with 
the genetic makeup of the protein containing at least one 
type I repeated thyroglobulin domain are chosen and selfed 

10 to produce an inbred plant. 

Conventional plant breeding methods can be used to transfer 
the equistatin structural gene or a functional derivative 
thereof via crossing and backcrossing. Such methods compri- 
se the further steps of (a) sexually crossing the insect- 

15 resistant plant with a plant from the insect-susceptible 
variety; (b) recovering reproductive material from the 
progeny of the cross; and (c) growing insect-resistant 
plants from the reproductive material . Where desirable or 
necessary, the agronomic characteristics of the susceptible 

20 variety can be substantially preserved by expanding this 
method to include the further steps of repetitively (d) 
backcrossing the insect-resistant progeny with insect- 
susceptible plants from the susceptible variety; and (e) 
selecting for expression of insect resistance (or an 

25 associated marker gene) among the progeny of the backcross, 
until the desired percentage of the characteristics of the 
susceptible variety are present in the progreny along with 
the gene imparting insect resistance. Subsequently, the 
inbreds according to this invention may be crossed with 

30 another inbred line to produce the hybrid. 



Potent iatiors 

The present invention further contemplates using, with the 
protein containing at least one type I repeated thyroglobu- 
lin domain, adjuvants, chemical or biological additives in 
35 an effort to expand the spectrum of targeted pests, to 



WO 98/58068 



PCT/NL98/00352 



- 35 - 

extend the duration of effectiveness of the protein ^contai- 
ning at least one type I repeated thyroglobulin domain or 
to help stabilize the agricultural composition of the 
protein containing at least one type I repeated thyroglobu- 
lin domain. 

Exemplary potentiators would include lectins, amphipathic 
proteins or complementary protease inhibitors. For example, 
the presence of more than one defensive protein, in the 
presence of other defensive proteins, may have an important 
role in the plant defense against insect attacks. It is 
known that Hemiptera and Coleoptera insects developed 
alternative pathways of protein digestion of foods contai- 
ning high levels of certain protease inhibitors. It may be 
advantagous to include inhibitors from families such as the 
Cystatins (Type I to III and the phytocystatins) , Kunitz- 
type inhibitors, Virgiferin inhibitors, Bowman -Birk inhibi- 
tors, Barley Trypsin inhibitors, Potato inhibitors I and 
II, Squash inhibitors, Ragi 1-2/Maize bifunctional inhibi- 
tors, carboxypeptidase A and B inhibitors and aspartic 
protease inhibitors (see Ryan (1990), Annu . Rev. Phvtooa- 
thol . , 28:425-49. 

Insects and nematodes 

The present invention contemplates protecting any plant of 
a taxon which is susceptible to infestation and damage by 
insects or nematodes having digestive cysteine and/or 
aspartic proteases. Such insect pests include in particular 
Coleopteran insects of the families Tenebrionidae, Curculi- 
onidae, Bruchidae and Chrysomel idae , Thysanopteran insects 
and Dipteran insects. Such plant parasitic nematodes 
include Globodera pallida, Heterodera schachtii and Meloi- 
dogyne incognita. Mention is made of insects of especially 
Leptinotarsa dacemlinea ta, Frankliniella occidentalis , 
Diabrotica virgifera and Liriotnyza crifolii commonly 
referred to as the Colorado potato beetle, Western flower 
thrips, western corn rootworm and leafminer. Other specific 
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insects include southern corn rootworm, Mexican been 
beetle, red flour beetle, confused flour beetle, cowpea 
beetle, boll weevil, rice weevil, maize weevil, granary 
weevil, lesser grain borer, flea beetles, Egyptian alfalfa 
5 weevil, bean weevil, yellow mealworm, asparagus beetle, 
squash bug. By the term "taxon" herein is meant a unit,, a 
botanical classification of genus or lower. It thus inclu- 
des genus, species, cultivars, varieties, variants and 
other minor taxonomic groups which lack a consistent 
10 nomenclature. 

Plants 

Exemplary plants include potato, maize, tomato, sorghum, 
cotton, soybean, dry beans, rape, alfalfa, asparagus, sweet 
potato and chrysanthemum. However, it is not to be const.ru- 

15 ed as limiting, in as much as these insects may infest 
certain other crops. Thus, the methods of the invention are- 
readily applicable to numerous plant species, if they are 
found to be susceptible to the plant species listed herei- 
nabove, including without limitation, species from the 

20 genera Medicago, Tri folium, Vigna, Citrus, Daucus , Arabi - 
dopsis, Brassica, Raphanus, Sinapis, Capsicum, Lycopersi- 
con, Nicotiana, Solarium, Helianthus , Bromus, Asparagus, 
Panicurn, Pennisetum, Cucumis, Glycine, Lolium, Triticum and r 
Zea . 

25 Examples 

The present invention is illustrated in further detail by 
the following examples. The examples are for the purposes 
of illustration only, and are not to be construed as 
limiting the scope of the present invention . All DMA 
30 sequences are given in the conventional S' co 3' direction. 
All amino acid sequences are given in convene ional amino 
terminus to carboxylic acid terminus direction. In carrying 
out the following examples, all DMA manipulations were done 
according to standard procedures, unless otherwise indica- 
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ted. See Sambrook et al . (1989), Molecular Cloning : A 
Laboratory Manual , published by Cold Spring Harbor Labora- 
tory (U.S.A. ) 

Example 1 : 

Activity of domain 2-3 of equistatin towards cathepsin D 
type aspartic proteases. 

Equistatin was isolated from sea anemone A . equina by 
a procedure that was ' described previously utilizing its 
inhibitory activity toward cysteine proteinase, papain 
(Lenarcic et al . (1997) J. Biol. Chem. 272: 13899). Besides 
cysteine proteinases, equistatin was screened for the 
inhibition of two other classes of proteinases, aspartic 
proteinases (cathepsin D) and serine proteinase (trypsin) . 
The inhibitory effect of equistatin was observed only when 
it was reacting with cathepsin D. The obtained inhibition 
was unexpected since equistatin is known as a strong 
inhibitor of papain-like cysteine proteinases. In addition, 
it was reported that p41 fragment does not have any inhibi- 
tory effect on cathepsin D (Bevec et al . (1996) J . Exo . Med 
183: 1331-1338). To ascertain that equistatin does not act 
as a substrate for cathepsin D we incubated both in differ- 
ent molar concentrations for different period of time and 
the mixtures subjected on the reverse -phase HPLC system 
(Fig. 9A) . No degradation products v/ere observed for 
cathepsin D. On the contrary, equistatin was found to be a 
good substrate for trypsin and this fact was used for the 
separation of thyroglobulin type-1 domains by a limited 
proteolysis with ^-trypsin. 500 /ig equistatin was incubated 
with 5^tg of S-trypsin in 0.5 ml o£ 0.1 M Tris/HCl buffer pH 
8.0, for 40 min at 37°C. Reaction was stopped by the addi- 
tion of tr if luoroacecic acid. The I?.- cryps ir. digest of 
equistatin v/as separated by high performance liquid chroma- 
tography (Milton Roy Co.) using a reverse phase Vydac CIS 
column equilibrated with 5% acetonitril containing 0.1% 
(v/v) trif luoroacet ic acid. Elution was performed using a 
linear gradient of 80% (v/v) acetonitrile containing 0.1% 
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(v/v) trif luoroacetic acid. Absorbance was monitored at 215 
nm. Two major peaks were obtained on the reverse -phase HPLC 
(Fig.9B). The molecular weights, estimated by the SDS PAGE 
under non-reducing conditions, were about 7,000 and 14,000 

5 (Fig. 10). Determination of the N-termini of the both 
fragments allowed them to be located in the sequence of the 
equistatin molecule shown schematically in Fig. 11. The C- 
termini of the fragments were not identified directly, but 
their size, as indicated by SDS PAGE, were consistent with 

10 their being a single and a double domain. The smaller 
fragment starts with the N-terminus of the equistatin and 
therefore corresponds to the first domain (eq d-1) . The 
larger fragment revealed two sequences, starting with Ala68 
and Vall52. The Lys67-Ala68 bond is positioned in the 

15 beginning of the second domain, while the cleavage of the 
ArglSl -Vall52 bond was not the result of the limited 
proteolysis (Fig. 11) . It was reported that isolated equist- 
atin is substantially nicked between ArglSl and Vall52, the 
chains are disulfide bonded and go apart only after reduc- 

20 tion (Lenarcic et al . (1997) J. Biol. Chem. 272: 13899- 
13903 ) . A narrow double band, visible on SDS PAGE , is most 
likely the result of the fragmentation very near to the C- 
terminus, meaning that this fragment represents the com- 
bined second and the third domain, eq d-2,3. According to 

25 the sequence data the three sequentially homologous parts 
of equistatin may have three potential proteinase binding 
sites. In previous work it was shown that the binding 
stoichiometry of equistatin and papain, as the representa- 
tive of cysteine proteinases, is 1:1 (Lenarcic et al . 

30 (1997) J. Biol. Chem . 272: 13899). When aspartic proteinas- 
es cathepsin D, was titrated with equistatin it was esti- 
mated that again 1 mol of equistatin was needed for the 
saturation of 1 mol of cathepsin D (Fig 12). This value was 
independent over a wide concentration range. In order to 

35 investigate the inhibitory activities of individual domains 
of equistatin a detailed kinetic analysis of the 
inactivation of papain and cathepsin D was performed. All 
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kinetic and equilibrium constants are given in Table 2. The 
first domain, eq d-1, exhibited practically the same 
inhibitory characteristics against papain as intact equist- 
atin, p41 fragment or ECI . This group of thyroglobulin 

5 type-1 domain inhibitors are considered as competitive, 
reversible and tight binding inhibitors of cysteine protei- 
nases, as papain, cathepsins B and L and cruzipain. The 
two-domain inhibitor, eq d-2,3, showed practically no 
inhibition of papain. 

10 The kinetics of binding of equistatin to cathepsin D 

was performed using a synthetic substrate which contains a 
chromophore, such as a nitrophenylalanine residue, in the 
PI' position. The assay sensitivity, afforded by H-Pro-Thr- 
Glu-Phe*Nph-Arg-Leu-OH as substrate, allowed us . to use 6.4 

15 nM concentration of the enzyme as a minimal concentration 
in the test. The obtained equilibrium dissociation constant 
- for the interaction between cathepsin D and equistatin (Ki 
= 0.3 nM) indicates that equistatin is a remarkably good 
inhibitor of aspartic proteinase, cathepsin D. For the 

20 papain active fragment, eq d-1, the Ki approx. ImM was 
determined. This value is several orders of magnitude 
higher than is the Ki value for the intact equistatin, 
indicating that the inhibitory active sice of the equista- 
tin must be located on other domains. The eq d-2,3 indeed 

25 exhibited practically the same inhibition characteristics 
as the whole equistatin {Ki > 0.6 nM) . Additionally, the 
formation of a tight complex between cathepsin D and 
equistatin was also visualized by a native PAGE (Fig. 10B) . 

Equistatin is the first protein inhibitor of cathepsin 
30 D with known primary structure of animal origin. Until now 
only derivatives of pepstatin were known to be as strong 
inhibitors of cathepsin D as is equistatin. 

The data provided in this study clearly show that 
different thyroglobulin type-1 domains present in equista- 
35 tin, despite their extensive amino acid sequence similar!- 
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ty, target different classes of proteinases, either cystei- 
ne or aspartic proteinase cathepsin D. 



Table 2. 

Kinetic constants for the interaction of equistatin, eq d-1 
and eq d-2 ,3 with papain and cathepsin D 



Enzyme 


Inhibitor 


1CT* x k a 
M' 1 s' 1 


10 4 x k d 
s- 1 


nM 


papain* 


equistatin 
eq d-1 
eq d-2 , 3 


12 ± 0.6 
1.8 ± 0.35 
ND d 


65 ± 1.5 
11 ± 0.3 
ND 


0.57 ± 0.04 
0.61 ± 0.01 
> 1000 e 


cathepsin D b 


equistatin 
eq d-1 
eq d-2 , 3 


ND 
ND 
ND 


ND 
ND 
ND 


0.3 ± 0.16 
> 1000 e 
0.4 ± 0.15 



Continuous rate assay was used for kinetic analysis of the 
interaction of papain with inhibitors. Ki was calculated from the 
ratio k d /k A . 

Data were determined from the inhibitory effect of inhibitor on 
the steady state, velocity for cathepsin D-catalysed hydrolysis 
of chromophoric substrate . 

Not determined . 

Neither eq d-2, 3 showed significant influence on papain nor 
eq d-1 on cathepsin D even at 5 mM; thus inhibition constant 
were estimated to be greater than 1 mM. 



Example 2 : 

Molecular cloning of equistatin 

From a single specimen of sea anemone Actinia equina L. , 
total RNA was isolated by disrupting the tissue in a liquid 
nitrogen. Two grams of ground, deeply frozen tissue was 
transfered into 20 ml of guanidinium thiocyanate solution 
(5.5 M GTC, 0.5 M EDTA, 0.2 M /? -mercapto - e thanol ) and 
homogenized in an electrical mixer (16,000 rpm, 4x 30 
sec). Subsequently, the solution was centrifuged at 
6000 X g , 20 min, at 15°C. 10 ml of clear supernatant was 
transfered to 10 ml of Cesium TFA solution (p = 1.5 nag/ml), 
supplemented with 2.5 ml of 0.5 M EDTA, pH=8.0. Sample was 
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centrifuged for 24h at 125,000 X g, 15=C. The supernatant 
was removed and total RNA (pellet) was dissolved in 1 ml of 
proteinase K solution (0.5 mg/ml) . After incubating the 
solution at 50°C, l/2h, total RNA was precipitated by 1/10 . 
5 volume of 3M KOAc , pH=5.2 and 2.5 vol. of 96% ethanol and 
the mixture was placed overnight in the freezer at ~20°C. 
Total RNA was pelleted at 8000xg and the pellet was dissol- 
ved in 2 ml of TE buffer. 1 ml of dissolved sample was 
heated at 65°C for 5 min, cooled on ice and subsequently, 

10 0,2 ml of TE buffer, supplemented with 3M NaCl was added. 
Whole sample was applied to the top of the oligo (dT) - 
cellulose bed in the column. The cellulose was washed three 
times with the TE buffer, supplemented with 0.5 M NaCl and 
0.1 M NaCl, respectively. Poly (A) + RNA was eluted with 1 ml 

15 (divided into 4 aliquots of 0.25 ml) of TE buffer, prewar- 
med to 65°C. One third of the sample (approx. 1 pg) was 
used for a synthesis of cDNA according to manufacturer's 
procedure (Amersham) . First strand cDNA synthesis was 
performed using 11 pi of mRNA solution, 4 pi 5 X reaction 

20 buffer, 1 pi Na-pyrophosphate solution, 1 pi human placen- 
tal ribonuclease inhibitor (5U/pl) , 2 pi of dNTP mix 
solution and oligo dT primer solution (1 pi) . After additi- 
on of 2 pi (10 U/pl) of reverse transcriptase, a mixture 
was incubated at 42°C for 40 minutes. Second strand cDNA 

25 was synthesized by adding the following components: 37.5 pi 
of second strand reaction buffer, 7 pi of E. coli DMA 
polymerase I (4U/pl) and 1 pi of E. coli ribonuclease H (1 
U/pl) . The reaction mix was incubated sequentially at 12°C 
for 60 minutes and 22°C for 60 minutes. After heat denatu- 

30 ration (5 minutes at 70°C) , 0.5 pi of T4 DNA polymerase 
(2U/pl) was added and the reaction mix was incubated at 
37°C for 10 minutes. 2.5 pi {250 pmoles) of EcoRI -adaptors 
were ligated to 1 pg of cDNA, using 2 pi of T4 DNA ligase 
(4U/pl) in 20 pi of ligation mixture. After Sh at 16°C, the 

35 ligation mixture was subjected to column purification/size 
fractionation of adaptor- linked cDNA, using spun columns 
and TE buffer. Collected fractions of purified cDNA were 
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phosphorylated by T4 polynucleotide kinase (BU/fil) a-nd cDNA 
was ligated into dephosphorylated Xgtll bacteriophage arms 
using T4 DNA ligase (40U/^g DNA) . Finally, whole ligation 
mixture was in vitro packaged using packaging extract from 
5 the same manufacturer (Amersham) . A portion of Xgtll cDNA 
library (10 s pfu) was used for the infection of Y1090 .E . 
coli cells and mixture was plated onto LB agar plates. 
Plaques were blotted onto nitrocellulose membranes. Membra- 
nes were rinsed in Tris-buf f ered saline with 0.01% Tween-20 

10 (TBST) 3 times and subsequently in blocking solution (20% 
(v/v) fetal serum in TBST) . After washing the membranes in 
TBST, the first antibody (rabbit anti equistatin IgG) in 
TBST was added and membranes were treated in solution 
overnight at 4°C. Afterthat, membranes were washed three 

15 times with TBST, and treated with second antibody (goat 
anti rabbit IgG. -horse raddish peroxidase). After final 
washing in TBST, a visualization with diaminobenzidine as a 
substrate was performed . Three positive clones were isola- 
ted from agar plates and after re-plating, phages were 

20 eluted from the surface of the agar plates with TE buffer 
(5 ml per* plate) . Phage DNA was isolated using Wizard 
Lambda Preps DNA isolation kit (Promega) according to the 
manufacturer's procedure. After restriction analysis v/ith 2 
fil of EcoRI restriction enzyme (10U//il) per 3 ^g of XDNA 

25 and size fractionation on 1% agarose gel, cDNA inserts were 
excised, purified with glass milk and subcloned into EcoRI 
cloning site of pUC19 plasmid. Whole ligation mixture (10 
til of each sample) was transformed into DH5c* E . coli cells 
by incubating the 100 ul cf highly competent cells (O.D. SS0 = 

30 0.6) and 10 /il of ligation mixture in a water bath (42°C) 
for 45 seconds. After addition of LB medium (900 fil) and lh 
incubation (37°C # 250 rpm) , bacterial mixture was plated 
onto LBA plates, supplemented with X-gal and I PTG and after 
overnight incubation (37°C) , white colonies were transfer- 

35 red into 5 ml of LB medium and incubated for an additional 
16 hours (37°C, 250 rpm) . Plasmids were isolated using 
Wizard Plasmid Purification System (Promega) according to 
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manufacturer's instructions and analysed by nucleotide 
sequencing using T? DNA polymerase (T7 sequencing .kit, 
Pharmacia) and ( 35 S]dATPaS (Amersham) . Sequencing of 
selected cDNA clones resulted in the full length cDNA clone 
5 given in Figure 1 

Example 3 : 

Expression of recombinant equistatin cDNA in Escherichia 
coli 

Two primers were used to amplify the mature protein of 
10 equistatin and to clone it as an NcoI-NotI fragment behind 
the g3 signal peptide present in the E.coli expression 
vector pB3 . Primer 1 was PDEI-1: CGC GCC ATG GCG AGT CTA 
ACC AAA TGC CAA and primer 2 was PDEI-2: GGG TGC GGC CGC 
GCA TGT GGG GCG TTT AAA. Correct inserts were sequenced to 
15 check for sequence errors and one clone was selected for 
obtaining recombinant protein. 

Recombinant equistatin was obtained by growing a single 
colony of E. coli strain HB2151 with the plasmid pB3 , 
carrying the equistatin cDNA, overnighc in 5 ml of LB 
20 growth medium with ampicillin (100 mg/1, final cone.) at 3 7 
°C, 250 rpm. 5 ml of the overnight culture was used for the 
inoculation of 800 ml of LB medium with ampicillin (100 
mg/1, final cone.) in a 2 1 flask. Cells are grown with 
shaking (30 °C at 250 rpm) to O.D.600 = 0.5. After that, an 
25 IPTG stock solution is add to a final cone, of ImM and 
growth of cells is continued using the same conditions as 
above for an additional 6h. Cells are placed on ice for lh, 
then pelleted at 4 OOOXg for 10 min ac 4 C C and resuspended 
in 50 ml of ice cold 10 mM MgS04 . Suspension is placed at - 
30 20 °C untill the liquid is completely frozed and afterthac, 
content is thawed by submerged the flask with the sample 
into a water bath (30 °C) . Immediatelly after re-thawing, 
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cells are removed by centrif ugation at 6000Xg for 1Q min at 

4 °C and supernatant is stored at -20 °C for subsequent 
affinity purification on papain-sepharose . 

Example 4 : 

Purification and characterization of recombinant equistatin 



A. Affinity purification using papain-sepharose: 

10 ml of a papain-sepharose slurry is mixed with a 15 ml 
of 0.02 M NaOH for 10 min. Subsequently, the slurry is 
applied on a 15 ml column with glass fritte. The column 

10 was washed 3 x with 30 ml of 100 mM Tris buffer, pH = 7 . 0 
and finally, with 10 ml of 50 mM MES buffer, pH = 6.5 
(with the addition of cysteine to a final cone. 0.6 mg/ml - 
) • Supernatant from 1 1 of bacterial culture, obtained as 
described above (example 3) is dropwise applied on a 

15 column. Papain-sepharose is then washed with 20 ml of 50 
mM MES buffer, pH=6 . 5 (without cysteine) and with 50 ml of 
20 mM Tris buffer, pH=7.5 . Sample is recovered by eluting 
the column with the 20 ml of 20 mM Tris buffer, pH=10.3 
(without adjusting the pH with HC1), 20 % DMSO . Purified 

20 equistatin was dialyzed against H 2 0 and concentrated using 
Sartricon mini -concentrators in order to achieve a final 
concentration of 350 /xM . 



B. Stoichiometry and inhibition constants for recombinant 
equistatin 

In a microtiterplate 20 /il papain solution (a fresh soluti- 
on of l mg/ml (Sigma) in MES buffer titrated with E-64 to 
determine the active fraction, usually 17%) is combined 
with 0-80 pi of known protein concentration. MES buffer (50 
mM MES, pH 6.5; 0.6 mg/ml L-cysteine; 1 mg/ml BSA fraction 
V) is added to a final volume of 150 pi. The mixture is 
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incubated for 30 min at room temperature and subsequently 
50 jil of substrate solution (60 jzl of 15 mg/ml Z-Phe-Arg- 
pNA dissolved in methanol is diluted in in 940 jxl MES 
buffer before use) . The plate is immediately placed in the 
microti terplate reader and measured at 405 nm. Readings up 
to an OD600 of 0.3 are linear. Rates of change are used to 
determine the activity with increasing amounts of inhibi- 
tor. Results are graphically represented and at stoichiome- 
tric concentrations the amount of dissociated complex is 
determined to determine the apparent equilibrium dissocia- 
tion constant (Ki) . These results established that one 
molecule of equistatin will inhibit approximately 1 molecu- 
le of papain. The apparent equilibrium dissociation con- 
stant for papain was estimated to be 0.6 nM in agreement 
with the data published for the purified protein (Lenarcic 
et al. (1997) J. Biol. Chem. 272: 13899-13903. 



Example 5: 

In vitro inhibition of Colorado potato beetle midgut 
protease activity 

Guts of final instar Colorado potato beetle larvae reared 
on methyl jasmonate induced plants were isolated and 
extracted essentially as described (Bolter and Jongsma, J*. 
Insect Physiol. 41: 1071-1078). In these gut extracts the 
proteases that are sensitive to the protease inhibitors of 
potato are already complexed. The residual protease activi- 
ty is composed of potato protease - inhibi tor- insensitive 
proteases which were induced in response to the methyl 
jasmonate induced potato inhibitors. These proteases are 
the proteases that render the beetle larvae insensitive to 
protease inhibitor defense of the potaco plant. We tested a 
broad range of different cysteine protease inhibitors for 
activity specifically against these induced potato protease 
inhibitor-insensitive cysteine proteases. Nearly all of 
these inhibitors were purified at the Jozef Stefan Institu- 
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te in Slovenia. Testing their potential against Colorado 
potato beetle was done using both general protein substra- 
tes (azocasein, table 2) as well as specific synthetic 
substrates (L-Arg-pNA, table 3; pGlu-Phe-Leu-pNA, table 4; 
5 Z-Phe-Arg-pNA, table 5; Z-Arg-Arg -pNA, table 6). In speci- 
fic cases inhibitors were tested at two concentrations, 
equimolar and in excess to the protease, in order to obtain 
an indication of the tightness of the complex. Most of the 
tested inhibitors were either inactive or only weakly 

10 inhibitory. Only the type I repeated thyroglobulin domain 
cysteine protease inhibitors that were tested (purified p41 
invariant chain fragment and purified equistatin) were 
consistently highly active against the endo- and exo- 
proteolytic activity of Colorado potato beetle larvae 

15 assayed (tables 2-6; figure 4). Importantly, this class of 
inhibitors was capable of inhibiting nearly all general 
cysteine protease activity. The equistatin peptide molecule 
and a functional derivative thereof were not very active 
against the aminopeptidase-like cysteine protease activity. 

20 Recombinant human stefin A was highly active against this 
type of activity. The best combinacion of inhibitors for 
full toxicity against the Colorado pocato beetle would 
therefore be a combination of equistatin or p41 invariant 
chain fragment and stef in-like inhibitors. 
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Effect of different cysteine protease inhibitors on general 
proteolytic activity of Colorado potato beetle larval gut 
extracts as measured with azocasein 



Family of cysteine 
protease inhibitor 


name 


Residual activity in 
excess of inhibitor 


Type I cystatins 


human stefin A 


90% 


rat stefin A 


90% 


porcine stefin B 


100% 


porcine stefin Dl 


105% 


porcine stefin D2 


105% 


Type II cystatins 


chicken cystatin 


105% 


human cystatin C 


105% 


Type III cystatins 


bovine Jcininogen 


110% 


human LMW kininogen 


65% 


hum kininogen 3rd domain 


20% 


Phytocys t at ins 


Oi&l iHnniiim ma t m c 

wliCJ. Ji LU i U. LJ-lll Wet j U J 

cystatin 


J. \JU 9 


cowpea cystatin 


75% 


lens cystatin 


60% 


soybean cystatin 


65% 


potato multicystatin 


90% 


bromelain inhibitor 


110% 


Type I domain of 
thyroglobulin 


p41 invariant chain 
fragment 


10% 


equistatin 


10% 


Plant Kunitz CPI 


PCPI 6.6 


120% 
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Table 4. 

Effect of different cysteine protease inhibitors on amino- 
peptidase activity as measured with L-Arg-pNA 





% of residual activity 


excess inhibitor 


equimolar 


equi statin and p41 invariant chain 
fragment 


75% 


85% 


human stefin A 


10% 


60% 


kininogens 


75% 


n.d. 


phytocystatins from Fabaceae 


75% 


n.d. 



Table 5. 

10 Effect of different cysteine protease inhibitors on speci- 
fic tri-peptidil-peptidase and endoprotease activity as 
measured with pGlu-Phe-Leu-pNA 





% of residual activity 


excess inhibitor 


equimolar 


equistatin and p41 invariant chain 
fragment 


-5% 


10% 


human stefin A 


90% 


95% 


kininogens 


20% 


n.d. 


phytocystatins from Fabaceae 


70% 


n.d. 
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Effect of different cysteine protease inhibitors on broad 
spectrum endoprotease activity as measured with Z-Phe-Arg- 
pNA. 





% of residual activity 


excess inhibitor 


equimolar 


equistatin and p41 invariant chain 
fragment 


20% 


30% 


human stefin A 


80% 


95% 


kininogens 


-20% 


n.d. 


phytocystatins from Fabaceae 


50% 


65% 



Table 7. 

Effect of different cysteine protease inhibitors on narrow 
spectrum endoprotease activity as measured with Z-Arg-Arg- 
pNA 





% of residual activitv 


excess inhibitor 


equimolar 


equistatin and p41 invariant chain 
fragment 


-5% 


20% 


human stefin A 


90% 


n.d. 


kininogens 


10% 


n.d. 
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Example 6 : 

In vitro inhibition of protease activity of adult western 
flower thrips, and leafminer flies, and final instar larvae 
of Colorado potato beetle and western corn rootworm as 
5 measured with FITC- labeled hemoglobin. 

Adult western flower thrips (Frankliniella occidenta- 
lis) and leafminer flies (Liriomyza trifolii) adult were 
harvested from a culture maintained on chrysanthemum plants 
and complete thrips and flies were homogenized in extracti- 
ng on buffer (200 mM /J-alanine-HCl , pH 3.5) in a volume of 5 
times the weight of the insects. The buffer pH was at the 
previously determined pH optimum of protease activity 
towards hemoglobin. Final instar Colorado potato beetle 
larvae maintained on potato plants as described in example 
15 5 were checked for gut aspartic protease activity by 
preparing a total gut extract in pH 3 buffer which is 
optimal for Colorado potato beetle aspartic proteases (200 
mM glycine, pH 3). Guts were homogenized in 100 fil buffer 
per gut. Third instar western corn rootworm larvae maintai- 
20 ned on corn roots were used to remove the guts . Ten guts 
were homogenized in 100 fil water and spinned twice to 
remove insoluble material. Two types of buffers were used 
in the enzymatic assay. One with a pH presumably favoring 
cysteine proteases (50 mM MES, pH 6.5; 0.6 mg/ml L-cystei- 
25 ne) and one for detecting aspartic proteases (200 mM 
glycine, pH 3). Supernatants were stored at -20 C. 

2 jil gut extract was combined with 2 inhibitor 
(2 mM pepstatin in methanol, 4 mM E64 in wacer, 2 mg/ml 
recombinant equistatin in water. The concentration of the 
30 other proteinaceous inhibitors was not known exactly) . 
Appropriate buffers were added to a final volume of 100 t±l . 
After 15' preincubation 20 /il substrate (5 mg/ml FITC- 
hemoglobin) was added and incubated for 30-4 5 min at 37 C. 
Reaction was stopped by the addition of 100 ^1 10% TCA . 
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Tubes were centrifuged and 100 pi supernatant mixed with 
100 fj.1 10 N NaOH was measured on a fluorimeter for the 
extent of hemoglobin hydrolysis. Measurements were done in 
duplicate on one (thrips, leaf miner and western corn 
rootworm) , or three (Colorado potato beetle) different gut 
extracts and varied by a maximum of +/- 5%. 

The effects of different cysteine and aspartate 
protease inhibitors are listed in table 9. They provide the 
effects of different protease inhibitors (Pis) against U PI- 
insensitive proteases" of thrips adults, and leaf miner 
flies on chrysanthemum , of Colorado potato beetle on potato 
and corn rootworm on corn because induced Pis present in 
the plantmaterial and ingested by the insect will be 
present in the extract in complex with the susceptible 
proteases . 

Thrips protease activity can be 92% inhibited by 
E-64 (cysteine PI) and 16% by pepstatin (aspartic PI) at pH 
3.5 which is optimal for thrips general protease activity. 
Apparently, aspartic proteases are not not dominant in this 
insect. P41-invariant chain resulted in 87% inhibition 
whereas equistatin afforded 95% inhibition of protease 
activity. Clearly both P41-invariant chain (cysteine PI) 
and equistatin (cysteine/aspartic PI) are both good inhibi- 
tors of thrips cysteine proteases, though equistatin may be 
slightly better due to the additional inhibition of* aspar- 
tic proteases. 

Leafminer proteases can only be fully inhibited by 
a combination of E64 (cysteine PI) and pepstatin (aspartic 
PI) (97%). Potato cystatin and Kunitz PCPI8.3 are both 
cysteine protease inhibitors which are capable of inhibi- 
ting 63% comparable to 73% by E64 . Addition of equistatin 
to those two inhibitors results in 92% inhibition, demon- 
strating that equistatin must have leafminer aspartic 
protease inhibitor activity apart from cysteine protease 
inhibitor activity. For optimal control of leafminer a 
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combined use of equistatin with potato cystatin and Kunitz 
PCPI8.3 may be necessary. 

Colorado potato beetle proteases at pH3 can only 
be fully inhibited (97%) by a combination of E64 (24%) and 
peps tat in (82%) . Addition of equistatin to either E64 or 
pepstatin increases the inhibition by 42% (24 %+42%=66% ) . 
and 12% ( 82%+12%=94%) respectively demonstrating that 
equistatin inhibits more than 50% of both aspartic and 
cysteine protease activity at this pH. equistatin alone 
inhibits 62% of total protease activity at this pH. Appa- 
rently, the partial inhibition at pH 3 combined with nearly- 
full inhibition at pH 6.5 (example 5) is sufficient for 
full control of this insect (example 7) . 

Western corn rootworm is known to possess a 
complement of both cysteine and aspartic proteases (Gilli- 
kin et al. (1992) Arch. Insect Biochem, Physiol . 19: 285- 
298) . The effects of equistatin, E64 and pepstatin were 
tested at two different pH values. The data in table 9 show 
that equistatin almost completely inhibited all cysteine 
and aspartic protease activity (93% at pH 6.5 and 98% at pH 
3) and was even more powerful than the combination of E64 
and pepstatin (79% and 89% resp . ) . These in vitro results 
are even better than the in vitro results for Colorado 
potato beetle in examples 5 and 6 and indicate that equis- 
tatin can be expected to be toxic towards western corn 
rootworm when expressed in corn roots. 
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In vitro inhibition assays measuring residual protease 
activity in extracts of different insects. 



inhibitors 


thrips 
pH 3 .5 


leaf - 
miner 

pH 3.5 


colo - 
rado 
potato 
beetle 

pH 3 


we s tern 
corn 
root - 

worm 

nil (Z C 


western 

corn 

root- 

worm 


control 


100 % 


100 % 


100 % 


100 % 


100 % 


E64 


8 % 


27 % 


76 % 


51 % 


35 % 


E64/equistatin (EI) 


n.d. 


n.d. 


34 % 


29 % 


8 % 


Pepstatin 


84 % 


46 % 


13 % 


58 % 


45 % 


Pepstatin/EI 


n.d. 


n.d. 


6 % 


6 % 


0 % 


E64/Pepstatin 


n.d. 


3 % 


3 % 


21 % 


11 % 


EI 


5 % 


24 % 


38 % 


7 % 


2 % 


p41 invariant chain 


13 % 


43 % 


n.d. 


n.d. 


n.d. 


p . cystatin 


n.d. 


73 % 


n.d. 


n.d. 


n.d. 


PCPI8 . 3 


n.d. 


43 % 


n.d. 


n.d. 


n.d. 


API 


n.d. 


37 % 


n.d. 


n.d. 


n.d. 


p . cystatin/?CPI8 . 3 


n.d. 


8 % 


n.d. 


n.d. 


n.d. 


p.cystatin/PCPI8 .3/API 


n.d. 


7 % 


n.d. 


n.d. 


n.d. 


p.cystatin/PCPI8 .3/EI 


n.d. 


n.d. 


n.d. 


n.d. 


n.d. 


bean cystatin 


14 % 


n.d. 


n.d. 


n.d. 


n.d. 



- Recombinant Kunitz PCPI8.3 (Stiekema et al . (1937) Plant 
Molec, Biol . 11: 255-269) was produced in the yeast Pichia. 
pastoris and purified from culture supernatant by cation 
exchange chromatography . 

Recombinant potato cystatin (p. cystatin) represents a 
monomer of mult icystat in. cloned by RT-PCR from potato cv . 
Superior and expressed and purified as a fusion protein 
with glutathione-S - transferase (Pharmacia) . 

- Equistatin was either purified from sea anemone (Lenarcic 
et ai.(1997) J. Biol. Chem. 272; 13899; Lenarcic et al . 
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Biol . Chem . 273: 12682) (thrips and leaf miner assays) or 
recombinant from E.coli (Colorado potato beetle and Western 
corn rootworm assays) 

Aspartic protease inhibitor (API) was purified from 
potato (Kreft et al . (1997) Phvtochemistrv 44: 1001-1006) 



Example 7: 

Toxicity of ecfuistatin towards Colorado potato beetle 
larvae 

10 Potato tubers of cultivar Surprise {Solanum tuberosum) were 
sprouted. Sprouted tubers were planted in 1 1 pots and 
grown for 3-4 weeks at a 22/18 °C, 16/8 hr day night rhyth- 
m. Plants 10-15 cm high were placed in glass jars together 
v/ith a paper wick on which 2 ul methyl j asmonate was 

15 pipeted. Jars were sealed immediately with parafilm and 
placed in a climate chamber of 3 0°C with continuous light . 
Control plants were placed in a chamber of 25 °C with a 16-8 
hr day night rhythm. After one day at 3 0 °C plants were 
taken from the jars and placed in the same chamber as the 

20 control. Plants were used on day 3. Freshly treated plants 
were used for each subsequent day of feeding. This treat- 
ment resulted in high endogenous PI levels in the methyl - 
jasmonate treated plants. The top meristems were removed 
from the plants and painted on both sides with a 175 /iM 

25 solution of recombinant equistatin in 0.3% agar obtained by 
mixing 1:1 a stock solution of 350 /zM equistatin with 0.6% 
water agar. Controls were painted with 0.3% water agar. The 
agar solution was applied at a concentration of 30 ^l/cm2 . 
The final concentration on the leaf was estimated to be 70 

30 /iM which is equivalent to 1.4 mg/g leaf. Painted leaves 
were placed in a tube containing 0.4 % agar and put on top 
of a filterpaper inside a petridish. 21-26 newly hatched 
Colorado potato beetle larvae were placed on the leaves and 
the petridish was put in an incubator set at 23 °C . Every - 

35 day fresh painted leaves replaced the old ones. 
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In tables 9 and 10 the effects on growth and mortality of 
the Colorado potato beetle -larvae are summarized. It is 
apparent from these tables that equistatin applied on 
control plants with low levels of endogenous protease 

5 inhibitors is capable of severely reducing development and 
causing high mortality rates on the larvae already after.- 4 
days. Applying equistatin on leaves containing high endoge- 
nous PI levels induced by prior treatment with methyl 
jasmonate further enhances the toxic effects of this 

10 inhibitor, however. This confirms the expected synergistic 
effect of this inhibitor because it specifically targets 
the "Pi-insensitive proteases" of Colorado potato beetle 
larvae . 



Table 9. 

15 Effect of recombinant equistatin on growth of Colorado 
potato beetle larvae 



treatment 


day 1 


day 2 


day 3 


day 4 


control 


n.d. 


n.d. 


9 


13 .8 


control + equistatin 


n.d. 


n.d. 


0 . 5 


0 . 7 


MeJa-control 


n.d. 


n.d. 


7 . 5 


10 


MeJa-control + equistatin 


n.d. 


n.d. 


n.g . 


n.g. 



21-26 larvae per experiment were assayed 
Larval weights are given in mg/larvae; 
n.d. is not determined; 
25 n.g. is not grown or dead 
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Table 10, 

Effect of recombinant equistat in on percent mortality of 
Colorado potato beetle larvae 



treatment 


day 1 


day 2 


day 3 


day 4 


control 


0 


0 


0 


0 


control + equistatin 


0 


10 


52 


75 


MeJa-control 


0 


0 


0 


0 


MeJa-control + equistatin 


0 


23 


77 


92 



21-26 larvae per experiment were assayed 



10 Example 8: 

In vivo effect of equistatin on oviposition rate of thrips 

A sample of 14 2 pM recombinant equistatin purified as 
described in example 4 was assayed for activity towards 
thrips. Sample was acidified with HC1 to pH 3 to stabilize 

15 the equistatin protein. Controls contained acidified water 
or 2.5 mg/ml BSA dissolved in acidified water. The ovipo- 
sition rate of thrips females was assayed using socalled 
Murai cages. Briefly, perspex tubes closed on side with a 
fine gauze were inoculated with 10 females and bee pollen. 

20 Tubes were closed with parafilm and 300 ul fluid was placed 
on top of the parafilm. A second layer of parafilm enclosed 
the fluid. Pollen and sample fluid were replaced every day 
for two days. Eggs deposited in the liquid sample were 
counted on day 2 . 
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Oviposition rate of adult females two days after being 
placed on different diets 



diet 


eggs per female per day 


relative percentage 


BSA 


1.8 


100 % 


water 


1.5 


83 % 


equistatin 


0.3 


17 % 



Example 9 : 

Modification of the equistatin gene for improved expression 
10 in plants 

The equistatin cDNA contains in the coding region several 
potential plant polyadenylat ion signals, mRNA instability 
motifs and a suboptimal codon usage for expression in 
plants. To improve the level of gene expression in plants 

15 these motifs may be removed and codons may be optimized by 
site specific mutagenesis without altering the primary 
protein sequence. Below an example of the modifications 
required to obtain improved gene expression in potato are 
given. The top strand represents the coding part of the 

20 cDNA clone, below that the suggested modif icat icns of the 
cDNA sequence are given and below tha^t the protein coding 
sequence is given using the one-letter code for the amino 
acid residues. 

1 ATGGCTCTTAGCCAA^CCAAGCCAAGTrrTCCPAAGGATTCGTCGTGATGATTTGG 
25 G G G 

-32 M A L S Q N 0 A K F S K G ? V V M I W 
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SO GT ACTATTCATTGCTTGTG CT AT AACTTCAACTGAAGCT AGTCT AAC C AAATGC C AACAG 

C G 
-13 VLFIACAITSTEAS LTKCQQ 

-1 +1 

5 12 0 CTCCAGGCCTCGGCTAACAGTGGTCTGATAGGTACTTATGTACCACAATGCAAAGAAACG 
G T T T 

3 LQASANS GLIGTYV PQCKET 



180 GGAGAGTTCGAAGAAAAACAATGCTGGGGATCGACTGGTTACTGTTGGTGTGTGGATGAA 
T G T 

10 23 G E F E E KQ C WGSTG Y CWCVDE 



2 4 0 GATGGAAAAGAGATTCTAGGAACCAAGATCCGTGGATCTCCGGATTGCAGCCGCAGAAAA 

T A ACT 
48 DGKEI LGTKIRGS PDCSRRK 



3 0 0 GCCGCGTTAACACTTTGCCAGATGATGCAAGCCATCATTGTTAATGTCCCTGGTTGGTGT 
15 T C G 

68 AALTLCQMMQAI I V NVPGWC 



3 6 0 GGCCCTCCATCGTGTAAAGCTGACGGCAGTTTTG ACGAGGTTC AGTGCTGCGC A^.GTAAT 
A A 

88 GPPSCKADGSFDEVQCCASN 



20 4 2 0 GGAGAATGCTACTGTGTGGAT AAGAAAGG A&AAGAACTTG AAGGC ACAAGAC AAC AGGGA 



108 GECYC V D KKGK E L £ G T R QQG 
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430 AGGCCAACCTGCGAAAGACACCTAAGCGAATGCGAGGAAGCTCGAATC AAGGCGCATTCA 

G T A 

12 8 RPTCEP. HLSECEEARI KAHS 



54 0 AACAGTCTTCGTGTTGAGATGTTCGTGCCAGAGTGTTT AGAAGATGGATC ATAT AACCCA 
5 T C T 

14 3 NSLRVEMFVPECLEDGSYNP 



600 GTAC AGTG CTGGCCTAGCAC AGGATACTGTTGGTGCGTCG ATG AAGG AGGGGT AAAGGTA 

T 

163 VQC WPSTGYCWCVDEGGVKV 



10 660 CCAGGTTCCGATGTCAGATTTAAACGCCCCACATGCTAA 

C C T 

188 PGS DVRFKRPTC 

199 



Example 10 : 

15 Construction of plant vectors for expression of equistatin. 

The potato Cab promoter (Map et al . (1993) Plant Mol . Biol . 
23: 605-612) was amplified from plasmid pPPG by PCR using 
the primers Pl-POTCAB and P2-POTCAB. Similarly the Nos 
terminator was amplified from plasmid pPPG using the 

20 primers NOS -TERM-DN and NOS -TERM -UP. The promoter and 
terminator fragment were cut with the restriction enzymes 
&2EcoRI and SacI and ligated into an EcoRI digested pUCAP 
vector (Van Engelen et al . (1995) Transgenic Research 4: 
288-290). A correct clone was selected and sequenced. This 

25 clone, pUCCABl was digested with Ncol and Bglll and used to 
subclone the equistatin coding region which was amplified 
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by PCR from plasmid pB3 -equistatin using the primers 
EQUISTAT-DN and EQUISTAT-BGL and also cut with Ncol and 
3glII, A correct clone was selected and the insert equista- 
tin cDISFA clone was sequenced. A correct clone, pUCCABl- 
equistatin, was digested with EcoRI . The EcoRI fragment 
containing the equistatin expression cassette was ligated 
into the plant vector pBINPLUS (Van Engelen et al . (1995) 
Transgenic Research 4: 288-290) which was also digested 
v/ith EcoRI. A correct clone was selected. This clone, 
pCABl -equistatin, was electroporated to electrocompetenc 
Agrobai cterium tumefaciens AGL-0 cells. Positive clones were 
selected on LB-medium containing 100 mg/1 kanamycin. 



Table 12. 



PCR-primers 


used 


for PCR- amplication 


name 


DNA 


Pl-POTCAB: 


5' 


-GGGGGGGAATTCCTGACCTCTTACTAACTCG 


P2 -POTCAB : 


5 ' 


-GGGGGGGAGCTCAGATCTTGCCATGGTTTTTCTTCTCTTTTTTTTTG 


NOS-TERM-DN: 


5 ' 


-AGATCTGAGCTCTCGTTCAAACATTTGGCA 


NOS -TERM-UP: 


5 ' 


- AAGCTTGAATTCG ATCTAGTAACAT AG 


EQUISTAT-DN: 


5' 


- GGGGCCATGGCTCTT AGCCAAAAC 


EQUISTAT-BGL: 


S 


-GGGGGAGATCTTTAGCATGTGGGGCGTTTAAIV 



Example 11: 

Transformation of potato with plant vectors containing the 
equistatin cDNA 

On day 1 an Agrobac cerium tumefaciens culture of AGL0 
containing the pCABl -equistatin binary vector was started 
in 50 ml LB-medium containing 50 mg/1 kanamycm and shaken 
for 2 days at 28 °C . On day 2 internodes from an in vitro 
culture of the potato cultivar Desiree line V were cue into 
0.5-i cm pieces and placed on R3B mediura (30 g/i sucrose, 
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4.7 g/1 Murashige and Skoog salts, pH 5.8 (KOH) 8 g/1 
purified agar, 2 mg/1 NAA and 1 mg/1 BAP) which was covered 
with 2 sterile filterpapers that had previously been soaked 
in 2 mi PACM medium (30 g/1 sucrose, 4.7 g/1 Murashige and 
Skoog salts, 2 g/1 casein hydrolysate, pH 6 . 5 (KOK) , 1 mg/1 
2,4-D and 0.5 mg/1 kinetine) . The dishes were taped with 
parafilm and incubated overnight at 24 °C under a regime of 
16 h light. At day 3 the A . tumefaciens culture was poured 
in a sterile petridish containing the explants. After 5-10 
min explants are removed from the culture, placed on a 
sterile filter paper to remove excess Agrobacteria and 
placed back on the R3B medium containing dishes after first 
removing the top filter paper (leaving one behind) . Dishes 
with explants were further incubated at 24 °C and 16 h 
light until day 5, when the explants were transferred to 
dishes containing ZCVK medium (20 g/1 sucrose, 4.7 g/1 
Murashige and Skoog salts, pH 5.8 (KOH), 8 g/i purified 
agar, 1 mg/1 zeatine, 200 mg/1 vancomycin, 100 mg/1 kanamy- 
cin, 200 mg/1 claforan) . On day 19 and subsequently every 
3-4 weeks explants were transferred to new ZCVK medium. 
When shoots appeared shoots were transferred to Murashige 
and Skoog medium containing. 20% sucrose (MS20) . After 
rooting plants were transferred to the green house. 

Example 12 : 

Bioassays of Colorado potato beetle larvae on transgenic 
potato plants expressing equistatin 

The equistatin cDNA sequence optimized for expression in 
potato plants was cloned into the pCABl vector and trans- 
formed to line V. Eight different primary t ransf ormant s 
were tested for resistance to newly hatched Colorado potato 
beetle larvae. Leaves were removed from young plants in the 
greenhouse and inserted into a tube containing 0.4% puri- 
fied water agar and placed in a petridish with filter 
paper. Six randomly picked newly hatched larvae were 
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inoculated per leaf. Leaves were replaced after two days 
with fresh leaves. On day 3 larval weights were measured 
for each larva individually. Table 13 provides the results, 
indicating that 3 out of 6 trans formants significantly 
5 retarded growth of the larvae. Some plants lack resistance 
most likely due to low expression caused by a suboptimal 
position of the T-DNA insertion in the plant genome. Plants 
were too young to extend the experiment for longer out of 
lack of leaf material, but it was observed that the larvae 
10 on pCAB-EIM-1 were all dead on day 4. The presence of the 
equistatin protein was confirmed by western blotting and 
estimated to be >0.1% in transgenics which showed resis- 
tance . 



Table 13. 



Results of bioassay on transgenic potato plants transformed 
with the equistatin gene optimized for expression in 
plants . 



Plant* 


Larval weight 6 


Line V 


9.93 a 


pBINPLUS 


10.67 a 


pCABl -EIM-1 


4 .03 b 


pCABl-EIM-2 


5.45 b 


pCABl-EIM-3 


8 .77 a 


pCABl-EIM-6 


8.92 a 


pCABl-EIM-7 


8.55 a 


pCABl -EIM-8 


5.85 b 


pCABl-EIM-9 


9.18a 


pCABl-EIM-10 


11.15 a 



a Plants tested were Line V, an in vitro plant transferred 
to the greenhouse simultaneously with the transf ormant s ; 
pBINPLUS, a line V transf ormant with the empty vector 
without the promoter-gene cassette; pCABl -10, the first 8 
line V transf ormants with the optimized eauistatin gene 
under the control of the CAB promoter. 
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b Average larval weights (mg) of six larvae. The * letter 
code following the weights of the larvae indicates signifi- 
cance as determined by ANOVA . 

Example 13 : 

Isolation of homologous gene sequences from other organisms 
in order to find or generate improved inhibitors. 

The 6 amino acid residues Gly-Tyr-Cys-Trp-Cys-Val which are 
strongly conserved among type I repeated thyroglobulin 
cysteine and aspartic protease inhibitors, whether from 
human, salmon or sea anemone sources, may be used to 
isolate homologous sequences with improved specificities. 
Degenerate PCR primers may be designed based on these 
sequences to amplify genomic or cDNA fragments which can be 
used as probes to isolate the entire coding sequence from 
for example cDNA libraries or by 5 ' RACE experiments from 
purified mRNA. Any organism including insects and plants 
may be used as new sources of type I repeated thyroglobulin 
domains. Collections of genes may be used in gene shuffling 
experiments to isolate new specificities. 



WO 98/58068 PCT/NL98/00352 

- 64 - 

Claims 



1 . A method of protecting a plant or a part of said 
plant against insect or nematode infestation by one or more 
insects or nematodes having digestive cysteine proteases, 
5 comprising presenting to a locus wherein said insect (s) or 
nematode (s) is (are) to be controlled an inhibitory amount 
of a cysteine protease inhibitor selected from the group of 
proteins containing at least one type I repeated thyroglo- 
bulin domain. 

10 2. The method of claim 1, wherein the insects have 

cysteine proteases that are insensitive to host plant 
derived cysteine protease inhibitors . 

3 . The method of claims 1 or 2 wherein the insects are 
one or more of Colorado potato beetle, corn rootworm, 
15 thrips and leaf miner. 

4. The method of claim 1, wherein the nematodes are 
cyst nematodes or root knot nematodes . 

5 . The method of any claims 1 to 4 , wherein the 
protein containing at least one type I repeated thyroglobu- 

20 lin domain is human p41 invariant chain fragment or a 
homologue or functional derivative thereof. 

6 . The method of any of claims 1 to 4 , wherein the 
protein containing at least one type I repeated thyroglobu- 
lin domain is isolated from the sea anemone Actinia equina. 

25 and having the amino acid sequence set forth in figure 1 or 
a functional derivative or a homologue thereof . 

7. The method of any of claims 1 to 4 , wherein the 
protein containing at least one type I repeated thyroglobu- 
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lin domain is a protein isolated from the eggs of chum 
salmon or a homologue or functional derivative thereof. 

8. The method of any of claims 1 to 7 , comprising 
inserting into the genome of the plant a sequence coding 
for a protein containing at least one type I repeated 
thyroglobulin domain with a promoter sequence active in the 
plant to cause expression of said protein at levels which 
provide an insect or nematode controlling amount of said 
protein. 

9. A method according to claim 8, further comprising 
the steps of : 

(a) culturing cells or tissues from the plant; 

(b) introducing into the cells or tissue at least one 
copy of a gene coding for the protein containing at least 
one type I repeated thryoglobulin domain ; 

(c) regenerating resistant whole plants from the cell 
or tissue culture . 

10 . The method according to claim 9 , which comprises 
the further step of sexually or clonally reproducing the 
whole plant in such a manner that at least one copy of the 
sequence coding for the protein containing at least one 
type I repeated thyroglobulin domain with a promoter 
sequence active in the plant is present in the cells of 
progeny of the reproduction. 

11 . The method according to claim 10, further compri- 
sing the steps of: 

(a) selecting a fertile , insect or nematode resistant 
plant prepared by the method of claim 10',- 

(b) sexually crossing the insect or nematode resistant 
plant with a plant from the insect or nematode susceptible 
plants from the susceptible variety; 
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(c) recovering reproductive material from the progeny 
of the cross and 

(d) growing resistant plants from the reproductive 
material . 

5 12. The method according to claim 11, for imparting 

insect or nematode resistance to a substantially homozygous 
population of plants of a susceptible variety, which 
comprises the further steps of repetitively: 

(a) backcrossing the insect or nematode resistant 
10 progeny with substantially homozygous, insect or nematode 

susceptible plants from the susceptible variety; and 

(b) selecting for expression of both insect or 
nematode resistance and the other characteristics of the 
susceptible variety among the progeny of the backcross, 

15 until the desired percentage of the characteristics of the 
susceptible variety are present in the progeny along with 
the insect or nematode resistance. 

13 . A transgenic plant and its sexual progeny 
resistant to attack by one or more insects or nematodes 

20 having digestive cysteine proteases, said transgenic plant 
expressing an insect of nematode controlling amount of a 
protein containing at least one type I repeated 
thyroglobulin domain. 

14. A biologically functional expression vehicle 
25 containing a promoter effective to promote expression of a 

downstream coding sequence in plant cells, a DNA coding 
region coding for the expression in plant cells of protein 
composed of at least one type I repeated thyroglobulin 
domain and a termination sequence effective to terminate 
30 transcription or translation of the genetic construction 
product in plant cells, the genetic construction effective 
to express in the cells of the plant insect controlling 
amounts of the protein containing at least one type I 
repeated thryoglobulin domain. 
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15. The biologically functional expression vehicle of 
claim 14, wherein the DNA isolate encodes an amino acid 
sequence set forth in Figure 1 or a functional derivative 
thereof . 

16 . The biological functional expression vehicle of 
claim 14, wherein the expression vehicle is pCABl . 

17. A host cell transformed with a biologically 
functional expression vehicle of any one of claims 14 to 
16 . 

18. The transgenic host cell of claim 17, wherein the 
DNA sequence is controlled by a promoter effective to 
promote expression of a downstream coding sequence in a 
plant cell, the DNA sequence coding region for the 
expression in plant cells of protein composed of at least 
one type I repeated thyroglobulin domain and a termination 
sequence effective to terminate transcription or 
translation of the product in plant cells, the genetic 
construction product in plant cells, the genetic 
construction effective to express in the cells of the plant 
insect controlling amounts of the protein composed of at 
least one type I repeated thyroglobulin domain to control 
one or more insects having digestive cysteine proteases. 

19. The method of any of claims 1 to 7 , wherein an 
agricultural composition containing a carrier and an insect 
or nematode control 1 ing or combat t ing amount of the 
cysteine protease inhibitor is applied to the locus. 

20. An agricultural composition containing a carrier 
and an insect or nematode controlling or combatting amount 
of a cysteine protease inhibitor as defined in claims 1, 5, 
6 or 7 as an active ingredient. 

21 . A type I repeated thyroglobulin domain inhibitor 
peptide with activity towards aspartic proteases, said 
peptide having the amino acid sequence extending from amino 
acid position 68-199 of equistatin of Fig. 1 or a modified 
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type I repeated thyreoglobulin aspartic protease inhibitor 
peptide wherein said modified peptide comprises a peptide 
having substantial amino acid identity to amino acid 
position 68-199 of equistatin; truncations of amino acid 

5 position 68-199 of equistatin; or truncations of the 
peptide having substantial amino acid identity to amino 
acid position 68-199 of equistatin, wherein said modified 
peptide is functionally equivalent to said amino acid 
position 68-199 of equistatin with aspartic protease 

10 inhibitor activity. 

22. A method of protecting a plant or a part of said 
plant against insect or nematode infestation by one or more 
insects or nematodes having digestive aspartic proteases, 
comprising presenting to a locus wherein said insect (s) or 
15 nematode (s) is (are) to be controlled an inhibitory amount 
of an aspartic protease inhibitor as defined in claim 21. 

23 . A method of protecting a plant or a part of said plant 
against insect or nematode infestation by one or more 
insects or nematodes having digestive cysteine and aspartic 

20 proteases , comprising presenting to a locus wherein said 
insect (s) or nematode (s) is (are) to be controlled an 
inhibitory amount of an aspartic protease inhibitor as 
defined in claim 21 and a cysteine protease inhibitor 
selected from the group of proteins containing at least one 

25 type I repeated thyroglobulin domain. 
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FIG. 1 

1 CTATGGCTCTTAGCCAAAACCAAGCCAAGTTTTCCAAAGGATTCGTCGTGATGATTTGG 
-32 MALSQNQAKFS KGFVVM IW 

60 GTACTATTCATTGCTTGTGCTATAACTTCAACTGAAGCTAGTCTAACCAAATGCCAACAG 
5-13 VLFIACAITSTEASLTKCQQ 

-1 +1 

120 CTCCAGGCCTCGGCTAACAGTGGTCTGATAGGTACTTATGTACCACAATGCAAAGAAACG 
8 LQASANSGLIGTYVPQC KET 

180 GGAGAGTTCGAAGAAAAACAATGCTGGGGATCGACTGGTTACTGTTGGTGTGTGGATGAA 
10 28 GEFEEKQCWGSTGYCWCVDE 

24 0 GATGGAAAAG AGATTCTAGGAAC CAAGATCCGTGGATCT C CGGATTGCAG C CGC AGAAAA 
48 DGKEILGTKIRGSPDCSRRK 

30 0 GCCGCGTTAACACTTTGCCAGATGATGCAAGCCATCATTGTTAATGTCCCTGGTTGGTGT 
68 AALTLCQMMQAI IVNVPGWC 

15 360 GGCCCTCCATCGTGTAAAGCTGACGGCAGTTTTGACGAGGTTCAGTGCTGCGCAAGTAAT 
88 GPPSCKADGSFDEVQCCASN 

42 0 GGAGAATGCTACTGTGTGGATAAGAAAGGAAAAGAACTTGAAGGCACAAGACAACAGGG A 
108 GECYCVDKKGKELEGTRQQG 

4 80 AGGCCAACCTGCGAAAGACACCTAAGCGAATGCGAGGAAGCTCGAATCAAGGCGCATTCA 
20 128 RPTCERHLSECEEARI KAHS 

540 AAC AGTCTTCGTGTTGAGATGTTCGTGCCAGAGTGTTTAGAAGATGGATCATATAACCCA 
148 NS LRVEMFVPECLEDGS YNP 

600 GTACAGTGCTGGCCTAGCACAGGATACTGTTGGTGCGTCGATGAAGGAGGGGTAAAGGTA 

168 VQCWPSTGYCWCVDEGGVKV 

25 

660 CCAGGTTCCGATGTCAGATTTAAACGCCCCACATGCTAAGAAAAACACAGTGAACAAAGT 

188 PGSDVRFKRPTC 

199 

720 GGCTAGTTTCCAGATCGAAAATAACTACAAAGGATTAATAAAATGTTAAAATAATTTCTC 

30 7 80 AATTCGGCTGTGATATATTTTTTCCAAGATAATTTAATCTGCATGTAGTTAACAGAAAAC 

84 0 AATCTCAACTAGAAATAAAGACTACGGTAATAATGACAAAAAAAAAAAA 
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Figure 3 
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Figure 4a 
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FIG. 7 
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FIG. 8 
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